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DEFECTS IN DRUG METABOLISM 

FIELD OF THE INVENTION 
The invention relates to genetic material, 
specifically primers, for use in a method designed to 
^ determine the genotype of an individual; and also a kit, 
including the genetic material of the invention, for 
performing the method of the invention. 

BACKGROUND OF THE INVENTION 
10 It is well known that genetic polymorphisms in 

drug metabolizing genes give rise to a variety of 
phenotypes. This information has been used to advantage 
in the past for developing genetic assays that predict 
phenotype and thus predict an individual's ability to 
15 metabolize a given drug. The information is of particular 
value in determining the likely side effects and 
therapeutic failures of various drugs. The availability 
of this sort of information will result in routine 
phenotyping being recommended for certain categories of 
20 patients. 

Drug metabolism is carried out by the cytochrome 
P450 family of enzymes. For example, the cytochrome P450 
isozyme gene, cyP2C9 encodes a high affinity hepatic 
[S] -warfarin 7-hydroxylase which appears to be principally 

25 responsible for the metabolic clearance of the most potent 
enantiomer of warfarin. Similarly, the cytochrome P450 
isozyme gene, CYP2A6, encodes a protein that metabolizes 
nicotine and coumarin and activates the tobacco-specific 
nitrosamine 4- (methyinitrosamino) -1- ( 3-pyridyl) - 

30 l-butanone) (NNK) . 

It is of note that the above gene products are 
also known to metabolize other substrates, for example, 
the cyP2C9 gene product is also known to metabolize 
Tolbutamide, Phenytoin, Ibuprofen, Naproxen, Tienilic 

35 acid. Diclofenac and Tetrahydrocannabinol . 

It follows that genetic polymorphisms or 
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mutations in either of the two aforementioned genes can 
lead to an impairment in metabolism of at least the 
aforementioned drugs. 

In so far as CYP2C9 is concerned^ sequences 
reported by Yasumori et al (1987 J. Blochem. 102:1075- 
^ 1082.) and Kimura et al (1987 Wuc. Acids Res. 15:10053- 

10054) show differences at several positions including a C 
to T base change that results in a Arginine/Cysteine 
polymorphism at amino acid 144. This polymorphism has 
been designated R144C. 
1^ In so far as CYP2A6 is concerned, a T to A base 

change at position 488 of the cDNA sequence described by 
Yamano et al (1990 Biochemistry 29:1322-1329) results in 
substitution of Leucine 160 by Histidine. Henceforth this 
mutant form of the gene will be designated CYP2A6vl. 
15 The variant CYP2A6vl encodes an enzyme that is 

unstable and catalytically inactive. It is found in the 
general population at a frequency of about 1% but does not 
account for all slow metabolizers of coumarin. 

Since the cDNA sequence structure of CYP2C9 and 
20 CYP2A6 are known, and since it is also known to perform 

genetic assays to determine whether a preselected mutation 
is present within a given gene, it should, in theory, be 
possible to design assays which specifically determine 
whether either of the aforementioned mutations are present 
25 in each of the respective aforementioned genes. 

However, we have found an extraordinarily high 
degree of exon homology in the cytochrome P450 genes. 
This has resulted in non-specific binding of assay 
materials and poor performance of assays. In the instance 
30 where primers have been used to hybridize to genetic 

material, non-specific binding of such primers has taken 
place, and in the further instance where primers have been 
used to hybridize to genetic material with a view to 
performing a polymerase chain reactions we have found that 
35 related genes have also been amplified, for example, 
CYP2A7, CYP2A12 and CYP2C8 have also been amplified. 



wo 95/34679 



PCT/US95;07605 



- 3 - 

SUMMARY OF THE INVENTION 

The present invention relates to novel variant ^ 

alleles in cytochrome P450 genes which express enzymes 
involved in the metabolism of particular drugs and/ or 
chemical carcinogens- 

^ One object of the present invention relates to 

the discovery of new mutant or variant CYP2A6 alleles 
wherein the human gene is characterized. A new variant 
allele has been found which is designated CYP2A6v2. The 
cDNA and genomic sequence of CYP2A6v2 is provided in the 
present invention. Another new gene related to 
CYP2A6 has been discovered and is designated CYP2A13. The 
cDNA and genomic sequence of CYP2A13 is provided in the 
present invention* 

Another object of the present invention relates 

15 to the use of intron sequences to specifically identify 
CYP2A6 and CYP2C9 variants in a gene specif ic detection 
assay. 

Another object of the present invention is to 
use an oligonucleotide probe, specific for regions unique 
20 to a particular CYP2 variant to screen for the presence or 
absence of the variant in a sample. 

Yet another object of the invention is to 
provide genetic material, a method, and a kit which enable 
genotyping of the CYP2C9 and CYP2A6 gene with a view to 
25 providing phenotypic information concerning drug 
metabolism. 

A further object of the present invention 
provides a method for diagnostically determining the 
sensitivity of a patient for specific drugs and chemical 
30 carcinogens- Such a method is widely applicable in 

determining the proper dosage of a drug for a patient. 

Another object of the present invention provides 
a method of genotyping CYP2A6 and CYP2C9 and determining 
whether a mutation has altered the sequence of these genes 
35 and hence altered sensitivity to particular drugs and 
chemical carcinogens . 
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In accordance with the present invention a 
method is provided which utilizes the finding that each 
variant of a CYP2 gene has specific nucleotide differences 
as compared with the wild-type CYP2 gene. Such nucleotide 
changes can be utilized in a probe-hybridization assay, 
which is capable of specifically detecting a chosen 
variant and not other variants. 

The present invention also provides a genotyping 
method for identifying the presence or absence of a 
mutation at codon 144 of the coding sequence of CYP2C9, or 
alternatively, at codon 160 of the coding sequence of 
CYP2A6, or alternatively, a gene conversion event 
involving CYP2A6 and CYP2A7 in exons 3, 6 or 8 comprising 
use of a portion of DNA. Such a mutation is then 
correlated to the sensitivity of particular drugs and 
chemical carcinogens. 

The present invention further relates to a gene- 
specific bioassay which is capable of distinguishing 
between the CYP2 genes and identify the presence oir 
absence of a mutation in CYP2A6 and cyP2C9 genes. Such a 
bioassay can diagnostically predict the sensitivity of an 
individual to particular drugs or chemical carcinogens. 
For example, the CYP2C9 variants identify a sensitivity to 
a commonly used anti-coagulant drug, warfarin. The CYP2A6 
variants identify sensitivity to coumarin, nicotine and 
nitrosamines. The sensitivity to nicotine may be used to 
predict a predisposition to tobacco-related diseases, a 
propensity to smoking and adverse reactions to exposure to 
nicotine. Further, CYP2A6 genes are associated with the 
activation of nitrosamines, elevated levels of which have 
been correlated with many cancers. 

The present invention also provides a method of 
genotyping the CYP2A6 and CYP2C9 genes using ailele- 
specific amplification reaction. 

In addition, a highly-specific combination 
genotyping bioassay has been developed to identify 
mutations within CYP2A6 and CYP2C9 which are linked to 
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sensitivity to particular drugs and chemical carcinogens. 
This combination bioassay comprises a gene-specific 
amplification reaction, an exon-specif ic amplification 
reaction and an endonuclease cleavage reaction wherein 
only one form, either mutant or wild-type is cleaved, 
^ producing either a single nucleic acid fragment or 
multiply nucleic acid fragments depending upon the 
presence or absence of the mutation. For example, one 
CYP2C9 variant, R144C, which contains a C^jz^T mutation can 
be identified by an Avail restriction site. CYP2A6 
variants can also be identified by their corresponding 
mutations. CYP2A6vl which contains a T^sa"*^ mutation can 
be identified by a Xcjnl restriction site. CYP2A6v2 which 
contains a T^^s-^A mutation can be identified by a Pdel 
restriction site. 
15 rp);ie present invention also relates to a method 

for screening patients for drug sensitivity prior to their 
treatment with that drug, thereby alerting a physician of 
a drug sensitivity. In addition, the method may be used to 
screen patients for a predisposition to cancers related to 
20 excessive nitrosamine activation, which are associated 

with mutations within the CYP2A6 gene locus. Further, the 
method may be used to screen patients for a sensitivity to 
chemical carcinogens, based upon the genotype of the 
CYP2A6 and/or CYP2C9 alleles. 
25 One such new allele variant, CYP2A6v2, has 98% 

nucleotide similarity and 80% amino acid similarity with 
the wild type CYP2A6, respectively. The present invention 
relates to the new CYP2A6v2 variant, the cDNA sequence and 
its genomic sequence wherein the alterations in sequence 
30 are within exons 3, 6 and 8, which are attributed to a 
gene conversion. In addition, another new gene, also 
involved in drug metabolism has been identified, and has 
been designated CYP2A13. This gene plays a similar role 
in drug metabolism as CYP2A6. These new gene sequences or 
35 fragments thereof are used as probes in identifying 
specific CYP2 variants in samples. In additions. 
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fragments of the new genes are used as primers in a 
genotyping assay. 

The invention further provides isolated CYP2Av2 
and CYP2A13 cDNAs for use in gene therapy and replacement 
protocols for individuals who are predisposed to 
^ sensitivity to needed drugs or to chemical or 
environmental carcinogens . 

In accordance with an aspect of the present 
invention, there are provided primary human cells which 
are genetically engineered with CYP2A6V2 or CyP2A13 DNA 
(HKA) which encodes a therapeutic agent of interest, and 
the genetically engineered cells are employed as a 
therapeutic agent. (The term "therapeutic," as used 
herein, includes treatment and/or prophylaxis.) 

Gene expression in an organism in accordance 
with the practices of this invention is regulated, 
inhibited and/or controlled by incorporating in or along 
with the genetic material of the organism non-native DNA 
which transcribes to produce an RNA which is complementary 
to and capable of binding or hybridizing to a mRNA 
20 produced by a gene located within said organism. Upon 

binding to or hybridization with the mRNA, the translation 
of the mRNA is prevented. Consequently, the protein coded 
for by the mRNA is not produced . In the instance where 
the toRNA translated product, e.g. protein, is vital to the 
25 growth of the organism or cellular material, the organism 
is so transformed or altered such that it becomes, at 
least, disabled. 

Accordingly, in the practices of this invention 
from a genetic point of view as evidenced by gene 
30 expression, new organisms are readily produced. Further, 
the practices of this invention provide a powerful tool or 
technique for altering gene expression or organisms 
through gene therapy. The practices of this invention may 
cause the organisms to be disabled or incapable of 
35 functioning normally or may impart special properties 

thereto. The DNA of CYP2A6v2 or CYP2A13 employed in the 
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practices of this invention can be incorporated into the 
treated or effected organisms by direct introduction into 
the nucleus of a eukaryotic organism or by way of a 
plasmid or suitable vector containing the special DNA of 
this invention in the case of a procaryotic organism. 



BRXEF DESCRIPTION OF THE DRAWINGS 
Embodiments of the invention are described by 
way of example only with reference to the accompanying 
^® figures wherein: 

Fig. 1. Shows the sequence of exon 2, intron 2 
and exon 3 of CYP2C8 and CYP2C9, cDNA sequences (from 4) 
are shown at the top of the page together with sequences 
from 6 genomic clones encompassing exon 2, intron 2 and 
15 exon 3 of CYP2C8 and CYP2C9 . The position of the 

polymorphism at codon 144 of CYP2C9 and the PGR primers 
are indicated. 

Fig. 2, Shows the sequence of intron 2, exon 3 
and intron 3 of CYP2A6, CYP2A7 and CYP2A12 . The position 
20 of the polymorphism at codon 160 in cyP2A6 and the PGR 
primers are indicated. 

Fig. 3. Shows the detection of cyP2C9 Arg^^^ Cys 
polymorphism by PGR. Following amplification, samples 
were digested with Avail and analyzed on a 1.8 % agarose 
25 gel . Lane I and lanes 3 to 6 show homozygous wild-type 
subjects, lane 2 a heterozygous individual and lane 7 
undigested PGR product. 

Fig. 4. Shows detection of CYP2A6 Leu ^^g. His 
polymorphism by PGR. Two parallel PGR reactions were 
30 carried out and the products analyzed on a 1 % agarose 

gel. Lanes 1, 3, 5 and 7 show the results of the wild-type 
specific assay and lanes 2, 4, 6 and 8 the results of the 
variant-specific assay for the same four subjects. 
Subjects I and 2 (lanes 1-4) are homozygous wild-type, 
35 subject 3 (lanes 5 and 6) heterozygous and subject 4 

(lanes 7 and 8) homozygous for the mutation. 
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Fig. 5. Shows distribution of the weekly 
maintenance doses for warfarin in patients (n=57) 
homozygous for the CYP2C9 wild-type allele (open bars) and 
heterozygous (n=37) for the R144C mutant allele (solid 
bars) . Arrows show the median weekly dose requirement of 
^ warfarin for each genotype. 

Fig. 6. Represents 7 -hydroxy lat ion of coumarin 
(%) in a family genotyped for the CYP2A6 and cyP2A6vl 
alleles^ showing a subject homozygous for the CYP2A6V1 
allele who is deficient in coumarin 7-hydroxylation. 
10 Fig- 7. Shows the difference between the genomic 

and cDNA sequences for the CYP2A6 gene. 

Figs. 8a and b. Shows the conversion event which 
leads to the CYP2A6v2 allele. 

Figs. 9a through 9c. Shows the detection of 
15 CYP2A6V2 by PGR. (Fig. 9A) gene-specific amplification by 
PGR of the CYP2A6 gene using E3F and E3R* Lanes 1 to 4 
show the 7.8 Kb band obtained from several representative 
human genomic DNA templates, lane 5 correspond to a 
negative control in the absence of template and lane 6 
20 contains 1 Kb DNA ladder (GIBCO BRL) as six markers. 

(Fig. 9B) Exon-specif ic PGR amplification of exon 3 from 
the 7.8 Kb long-PCR product and restriction endonuclease 
pattern obtained after digestion with Xcml (left) and Ddel 
(right) to detect the CYP2A6vl and GYP2A6v2 alleles, 
25 respectively. The genotypes shown correspond to: wild type 
(+/+), heterozygous (+/-) and homozygous (-/-) subjects. 
(C) The genotyping strategy which has been developed. 
Exons are indicated by boxes. The position of the 
corresponding primer pairs are indicated by horizontal 
30 arrows. Xcjnl and Ddel restriction sites generate 

digestion patterns for the different alleles having 
fragment sizes as shown. 

Fig. 10. Schematic diagram depicting- 
methodology underlying a CYP2C9 genotyping assay. 
35 Fig. 11. CYP2A6V2 cDNA sequence. 

Fig. 12. CYP2A6V2 genomic DNA sequence having 
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7216 base pairs. 

Fig. 13. cyP2A13 cDNA sequence. ^ 
Fig. 14. CYP2A13 genomic DNA sequence having 
8779 base pairs. 

Fig. 15. Agarose minigel electrophoresis of PGR 
products. The CYP2C9 wild-type allele (Arg-144) and R144C 
respectively. Lanes marked "+/+" and "+/"" contain 
homozygous wild types and heterozygotes respectively. the^ 
right-hand lane contains a 100 bp ladder- 

DETAILED DESCRIPTION OF THE INVENTION 
The cytochrome P450 isozyme gene, CYP2C9 encodes 
a high affinity hepatic tS]-warfarin 7-hydroxylase which 
appears to be principally responsible for the metabolic 
clearance of the most potent enantiomer of warfarin along 
with metabolizing a number of other drugs and chemical 
carcinogens. Similarly, the cytochrome P450 isozyme gene, 
CYP2A6, encodes a protein that metabolizes nicotine, 
coumarin and a host of other drugs and chemical 
carcinogens CYP2A6 also activates the tobacco-specific 
nitrosamine 4- (methylnitrosamino) -1- ( 3-pyridyl) - 
20 i-butanone (herein referred to as "NNK") . Many cancers 
have been associated with activation and/or accumulation 
of nitrosamines. The present invention allows detection of 
a predisposition to such cancers. 

It is of note that the above gene products are 
25 also known to metabolize other substrates. For example, 
the CYP2C9 gene product is also known to metabolize 
Tolbutamide, Phenytoin, Ibuprofen, Imipramine, Naproxen, 
Tienilic acid. Diclofenac and Tetrahydrocannabinol and 
hence can also be used to detect sensitivities to these 
30 drugs. A list of CYP2C9 drug substrates has been 
documented and is incorporated herein by reference 
(Gonzalez & Idle 1994 Clin. 'PhsiTmsiCoVi±n&t 26:59-70). 
Hence, the present invention can be used to screen for 
sensitivities to these drugs. 
35 In addition, CYP2C9 has been associated with the 

metabolism of chemical carcinogens, such as polycyclic 
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aromatic hydrocarbons. For example, the most ubiquitous 
environmental carcinogen, benz-[a]-pyrene is metabolized 
by CYP2C9. Benz- [a] -pyrene is found in tobacco, barbecued 
meats, car exhaust and generally, in polluted air. This 
compound, as it accumulates in the body becomes a potent 
^ DNA intercalating agent, ultimately resulting in cell 

transformation and the fopnation of tumors. The present 
invention provides a diagnostic method of screening 
individuals for their ability to metabolize and hence 
inactivate benz- [a] -pyrene. For example, a homozygote 
wild-type CYP2C9 individual would be better able to 
tolerate high levels of benz- [a] -pyrene than a 
heterozygote of the cyP2C9 allele. 

Similarly, the CYP2A6 allele is associated with 
drug sensitivity and carcinogen metabolism. Coumarin 
sensitivity is directly related to the presence of a 
variant CYP2A6 allele, such as CYP2A6vl, CYP2A6v2 and also 
CYP2A13. Coumarin is a drug used in treatment of 
neoplastic diseases, such as lymphomas. (See Martindale; 
The Extra Pharmacopoeia 1993 Ed. Reynolds, J.E.F., The 

20 Pharmaceutical Press, London, p. 1358). Its suggested 

dosage is very high. Therefore, the present invention is 
useful in determining a patient's sensitivity to the drug 
in order to prescribe a proper dosage and avoid toxicity. 
Another drug, Thiotepa^ , is used in the 

25 treatment of a variety of neoplastic diseases, such as in 
treating women with breast cancer and children with brain 
tumors. Thiotepa is metabolized by CYP2A6 into Tepa, 
which is an intermediate more therapeutically potent than 
Thiotepa. Therefore, if a patient has a very active 

30 CYP2A6 enzyme, it is likely the patient will require lower 
doses of Thiotepa to provide a therapeutically effective 
amount. As one can see, the dosage provided to a patient 
is dependent upon the rate a patient is capable of 
metabolizing activating the drug. The present invention 

35 has identified variant alleles whose enzymatic activity is 
compromised. In addition, the present invention provides 
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a simple method of genotyping patients for Thiotepa drug 
sensitivity. With information concerning patient 
sensitivity to such drugs, the proper dosage can be 
provided, hence maximizing drug efficiency and minimizing 
drug toxicity. 

^ Further, CYP2A6 has been associated with 

nicotine metabolism. In addition to being an active 
ingredient in tobacco, nicotine also has several clinical 
uses. Nicotine is used clinically to treat various 
neurological disorders, such as Parkinson's disease and 
Alzheimer's disease. In addition, nicotine is used to 
treat tobacco addiction. In all of these situations, it 
is important to know a patient's sensitivity to nicotine, 
since extremely sensitive patients will become violently 
ill upon administration of nicotine. Therefore the 
15 present invention provides a method of identifying 

nicotine-sensitive patients by genotyping a patient's 
CYP2A6 allele. The present invention also provides a 
convenient method for determining an individual's general 
predisposition to using tobacco based upon their 
2^ sensitivity to nicotine. 

In addition, CYP2A6 is involved in activating 
nitrosamines, thereby producing the potent carcinogen NNK. 
Increased levels of NNK have been associated with a 
variety of cancers, including but not limited to lung 
25 cancer, nasal -pharynx cancers, throat cancers and colon 
cancers. In general, elevated levels of CYP2A6 has been 
associated with cancers associated with exposure to 
nitrosamines. The present invention may detect a 
patient's predisposition to such cancers. The presence of 
30 a CYP2A6 gene or a variant thereof will affect the 

likelihood that procarcinogens present in tobacco smoke 
will be activated into carcinogenic nitrosamines and 
nitrosamine-derivatives and therefore result in the 
development of a cancer. 
35 It follows that genetic polymorphisms or 

mutations in either of the two aforementioned genes can 
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lead to an impairment in metabolism of at least the 
aforementioned drugs and chemical carcinogens, 
The present invention relates to the 
identification of the absence or presence of mutations in 
CYP2C9 and CYP2A6 and thus predict the phenotype of an 
individual and so predict whether and how an individual is 
likely to metabolize particular drugs and chemical 
carcinogens. For instance, the R144C mutation arising ^ 
from a C^^j^^T base substitution in the cyP2C9 gene results 
in a reduction in warfarin metabolism. This implies that 
patients with this mutation receiving warfarin require a 
lower dose to maintain an anticoagulation target than 
those patients who do not have the mutation and are also 
receiving warfarin. Conversely, homozygous wild-types 
require higher doses in order to maintain an 
anticoagulation target. 

"Mutation", as the term is used herein denotes 
an allelic variation of a known sequence, which alters the 
expressed gene product's activity. Such a variation need 
not completely inactivate the gene product's activity but 
merely alter it. 

Similarly, one mutation within CYP2A6vl arising 
from a T^Bg-^A base change results in substitution of 
Leucine 160 by Histidine. Another CYP2A6 variant, 
CYP2A6V2, has been identified which differs from CYP2A6 in 
the regions of exons 3, 6 and 8. One particular mutation 
in CYP2A6V2, T^^j-'A mutation is useful in the assay of the 
present invention. These substitutions are very useful in 
detecting predispositions to cancers associated with 
tobacco and activation of nitrosamines. The -normal CYP2A6 
30 enzyme functions in the metabolism of nicotine, one of the 
carcinogenic compounds in tobacco. 

In addition, the present, invention relates to 
the identification of a new variant of CYP2A6 designated 
CYP2A6V2. The variations of CYP2A6v2 from CYP2A6 bear 
35 sequence relatedness with the corresponding exons of the 

CYP2A7 gene, suggesting a recent gene conversion. The cDNA 
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and genomic sequence for this gene is provided in the 
present invention. Hence, at least three different 
allelic variants of cyP2A6 exist and are illustrated in 
Figure 8. These allelic variants include CYP2A6, CYP2A6V1 
and CYP2A6V2. 

Further, the present invention relates to a new 
CYP2A gene, designated CYP2A13 . This gene produces an 
inactive form of CYP2A6, however variants at particular 
positions, including amino acid positions 117, 209 and 365 
produce an enzyme which may alter the enzyme's activity 
and hence affect drug sensitivity- These mutations in 
CYP2A6 are likely to result in a deficiency or impaired 
activity of one of the enzymes responsible, for example, 
for metabolizing drugs, nicotine and nitrosamines. 

CYP2A13 is considered a new cytochrome P450 
gene. However, since the CYP2A13 gene product has a 
similar function as the CYP2A6, it is discussed herein as 
a variant of CYP2A6. That is, assays using the specific 
mutated amino acid positions 117, 209 and 365 of CYP2A13 
and detecting variations at those positions are indicative 
of CYP2A6-like variant functions. 

In one embodiment, the CYP2A6v2 or CYP2A13 
proteins or functional portions thereof are expressed as 
recombinant genes in a cell, so that the cells may be 
transplanted into an individual in need of gene therapy 
due to the predisposition to a carcinogen-associated 
cancer or a sensitivity to a drug. To provide gene 
therapy to an individual, a genetic sequence which encodes 
for all or part of the CYP2A6V2 or CYP2A13 ligands are 
inserted into vectors and introduced into host cells. 
30 Examples of vectors that may be used in gene therapy 

include, but are not limited to, defective retroviral, 
adenoviral, or other viral vectors (see, e. g. , Mulligan, 
R.C., 1993, Science , 260:926-932). The means by which the 
vector carrying the gene may be introduced into the cell 
35 includes, but is not limited to, microinjection, 

electroporation, transduction, or transfection using DEAE- 



20 



25 



wo 95/34679 



PCT/US95/07605 



- 2.4 - 

o 

dextran, lipofection, calcium phosphate or other 
procedures known to the skilled routineer (see, e.g., 
Sambrook et. al. (Eds.), 1989, In "Molecular Cloning. A 
Laboratory Manual", Cold Spring Harbor Press, Plainview, 
New York) . Examples of cells into which the vector 
^ carrying the gene may be introduced include, but are not 
limited to, continuous culture cells, such as COS, 
NIH/3T3, and primary or culture cells of the relevant 
tissue type. 

More specifically, there is provided a method of 
enhancing the therapeutic effects of blood cells, that are 
infused in a patient, comprising: (i) inserting into the 
blood cells of a patient a DNA (RNA) segment encoding 
CYP2A6V2 or CYP2A13 gene product that enhances the 
therapeutic effects of the blood cells; and (ii) 
introducing cells resulting from step (i) into the patient 
under conditions such that the cells resulting from step 
(i) "target" to a tissue site. In the alternative, as 
previously described the cells are not "targeted" and 
functions as a systemic therapeutic. The genes are 

20 inserted in such a manner that the patient's transformed 
blood cell will produce the agent in the patient's body. 
In the case of antigen-specific blood cells which are 
specific for an antigen present at the tissue site, the 
specificity of the blood cells for the antigen is not lost 

25 when the cell produces the product. 

Alternatively, as hereinabove indicated, 
CYP2A6V2 or CYP2A13 DNA (RNA) may be inserted into the 
blood cells of a patient, in vivo, by administering such 
DNA (RNA) in a vehicle which targets such blood cells. 

30 Further details regarding methods of gene 

therapy are provided in Anderson et al., U.S. Patent No. 
5,399,343 which is herewith incorporated herein by 
reference. 

In another embodiment, antisense CYP2A6v2 or 
35 CYP2A13 DNA or RNA may be used to control the expression 
of CYP2 gene. For example, antisense therapy may be used 
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to control CYP2A6's ability to activate dangerous 
nitrosamines by curbing its expression. Methods of 
producing such antisense molecules are described in U.S. 
Patent No. 5,190,931, which is incorporated herein by 
reference. 

Developing a genotyping assay, which could 
distinguish the CYP2 genes of interest from other 
cytochrome P4 50 genes required careful engineering since 
these genes have a high degree of sequence homology. To 
overcome this problem, one embodiment of the present 
invention has elucidated the genomic sequence structure of 
CYP2C9 and CYP2A6 with a view to making, in part, intron 
specific primers. That is to say primers which, in part, 
hybridize to at least one intron, preferably an intron 
adjacent to an exon including the mutation of interest, in 
the gene to be examined. since there is less homology 
between the introns of cytochrome P450 genes, it has been 
found that using intron specific primers, gene specific 
assay can be undertaken. The present invention has a 
further advantage of using intron specific primers in so 
20 far as the use of such primers facilitates the manufacture 
of an optimum length of DNA which in turn facilitates the 
specificity of the instant bioassay. 

A "genotyping" assay as the term is used herein 
refers to any diagnostic or predictive test to detect the 
25 presence or absence of allelic variants of a known gene 

sequence at a specified gene locus. Two gene loci are of 
particular interest in the present invention, CYP2A6 and 
CYP2C9. 

Further, the present invention relates to 
30 differences between the genomic DNA sequence structure and 
the cDNA sequence structure, as illustrated in Figure 7. 
As a result, primers directed at the genomic sequence 
structure have been developed which are more reliable. 

Several methods are provided for identifying the 
35 presence or absence of a mutation at codon 144 of the 

coding sequence of CYP2C9, or alternatively, at codon 160 
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of the coding sequence of CYP2A6, or alternatively, a gene 
conversion event involving cyP2A6 and CYP2A7 in exons 3, 6 
or 8 comprising a DNA encompassing the region of a CYP2 
gene uniqpae to that variant. 

One such method relates to an assay which 
contemplates the use of one specific primer which 
specifically encompasses the region containing the 
mutation, and a second primer which is complementary to 
another portion of the gene. The second primer sequence 
chosen is based upon the cyP2A6, cyP2C9 or CYP2A13 
sequences as set forth in figures 12, 1 and 14, 
respectively, depending upon the preferred size of the 
amplification product. One skilled in the art will know 
how to select second primer based on the region of gene 
chosen for amplification. These primers need not be 
identical to a given sequence but must be sufficiently 
complementaary to hybridize to the target region in a 
specific manner. In short, the primers are preferably at 
least substantially homologous to the nucleic acid 
sequence provided. 
20 Nucleic acid sequences includes, but is not 

limited to, DNA, RNA or cDNA. Nucleic acid sequence as 
used herein refers to an isolated nucleic acid sequence. 
Substantially homologous as used herein refers to 
substantial correspondence between the nucleic acid primer 
25 sequence of as described herein and that of any other 
nucleic acid sequence. Substantially homologous means 
about 50-100% homologous homology, preferably by about 70- 
100% homology, and most preferably about 90-100% homology 
between the particular sequence discussed and that of any 
30 other nucleic acid secjuence. 

In the instant application, the term "primer" is 
further used to designate a molecule comprising at least 
three nucleotides, the exact length being determined by 
the requisite amount of DNA needed, under given reaction 
35 conditions, to bind to or interact with a test sample so 
as to identify the presence or absence of either of said 
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mutations. Preferably, the primer is usually between 15 
and ideally about 20 to 50 oligonucleotides in length. 

The primer is selected, or adapted, to be 
substantially complementary to a part of DNA which is 
adjacent to the region of at least one of the 
^ aforementioned mutations. Thus such a primer is able is 
hybridize with a part of DNA that contains a region in 
which the mutation of interest may be found. Although the 
primer may not reflect the exact sequence of the region in 
which the mutation is thought to occur, the more closely 
^® the primer is to this sequence, then the better the 

binding will be. Ideally, the more closely the sequence 
of the 3 ' end of the primer is to said region the better 
the binding or interaction will be. 

An alternative method for using the sequence 
15 unique to a variant for detection relates to use of an 
oligonucleotide probe for specifically detecting the 
presence or absence of a CYP2 variant gene in a sample, 
this methodi comprises the steps of contacting the sample 
with a nucleic acid probe, allowing hybridization, forming 
20 a probe: CYP2 variant complex; washing excess probe from 
probe: CYP2 variant complex; and detecting probe: CYP2 
variant complex, wherein a positive signal is an 
indication of the presence of the CYP2 variant in the 
sample. 

25 The hybridization of the probe to sample nucleic 

acids can be carried out by any of the methods commonly 
used in the art. Such methods include but are not limited 
to. Dot blot. Colony hybridization. Southern blot, 
solution hybridization and in situ hybridization. 

30 Washing the excess probe from the probe: CYP2 

variant DNA can be accomplished by many well-known 
methods. Simply rinsing the complex with excess buffer 
will facilitate removal of excess probe. Alternatively, 
washing may entail separating the probe: CYP2 variant 

35 complex from excess probe. Many methods are known to one 
skilled in the art and include but are not limited to 
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centrifugation, filtration and magnetic force. 

According to the present invention there is ^ 
provided a portion of DNA suitable for use as a primer in 
a method for identifying the presence or absence of a 
mutation either at codon 144 of the coding sequence of the 
gene CYP2C9, or alternatively, at least one gene 
conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 
or 8, or alternatively, at codon 160 of the coding _ 
sequence of the gene CYP2A6 ; comprising a DNA which is 
adapted to hybridize to at least one intron of at least 
one of said genes. 

In one embodiment, the method comprises the use 
of at least one restriction endonuclease to digest DNA 
from individuals to be tested. In this instance, DNA from 
individuals positive for the wild-type form of CYP2C9 
provide a digest with a restriction endonuclease, such as 
Avail results in production of two fragments, a first 
fragment including 270 base pairs and a second fragment 
including 50 base pairs. In contrast, individuals having 
the aforementioned mutation in CYP2C9 present a single 
20 fragment of 320 base pairs only. This is due. to a loss of 
the Avail site. The CYP2A6 gene variants can also be 
distinguished by the occurrence of specific restriction 
endonuclease sites. The CYP2A6vl variant, which is a 
T^gg-+A mutation in exon 3 can be identified by a variant- 
25 specific XcmL restriction site. The CYP2A6v2 variant, 
which contains a C^^j-^A mutation within exon 3 can be 
identified by a variant-specific i?del restriction site. 
The wild-type CYP2A6 gene does not contain either an Xcml, 
or Ddel site. The results of such restriction 
30 endonuclease digestions are illustrated in Figure 9. 

It may be necessary to amplify the DNA prior to 
digestion. Such may be the case when the DNA of interest 
is present in minute quantities in a sample. In such 
circumstances, amplification of DNA to be tested is 
35 undertaken before digesting the DNA as described above. 
This provides for a greater quantity of materials. 
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Amplification is performed using any conventional 
technique, such as by a PGR reaction. Many other 
techniques for amplification can be used in producing 
sufficient DNA for detections. Such amplification 
techniques are well-known to the skilled artisan and 
include, but are not limited to polymerase chain reaction 
(PGR), PGR in situ, ligase amplification reaction (LAR) , 
ligase hybridization, QB bacteriophage replicase, 
transcription-based amplification system (TAS) , genomic 
amplification with transcript sequencing (GAWTS) and 
nucleic acid sequence-based amplification (NASBA) • A 
general review of these methods is available in Landegren, 
et al.. Science 242:229-237 (1988) and Lewis, R. , Genetic 
Engineering News 10:1, 54-55 (1990), which is incorporated 
herein by reference. 

One embodiment of the present invention uses 
oligonucleotide primers in an amplification and detection 
assay. A basic description of nucleic acid amplification 
is described in Mullis, U.S. Patent No. 4,683,202, which 
is incorporated herein by reference. The amplification 
reaction uses a template nucleic acid contained in a 
sample, two primer sequences and inducing agents. The 
extension product of one primer when hybridized to the 
second primer becomes a template for the production of a 
complementary extension product and vice versa, and the 
25 process is repeated as often as is necessary to produce a 
detectable amount of the sequence. 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 
30 for this purpose include, for example, E*coli DNA 

polymerase I, thermostable Taq DNA polymerase, Klenow 
fragment of E.coli DNA polymerase I, T4 DNA polymerase, 
other available DNA polymerases, reverse transcriptase and 
other enzymes which will facilitate combination of the 
35 nucleotides in the proper manner to form amplification 

products . 
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A sample being screened for the presence or 
absence of a mutation in CYP2A6 and/or CYP2C9 genes can be 
tested with the instant invention. The nucleic acid 
material can be in purified or nonpurified form, provided 
the sample contains the CYP2A6 and/or CYP2C9 genes. The 
^ sample may be derived from any tissue or bodily fluid, 
wherein the patient's DNA can be found. A clinically 
practical type of sample is a blood specimen which 
contains patient DNA and can conveniently be genotyped in 
the bioassay of the present invention. 
10 The "primers", as the term is used in the 

present invention refers to an oligonucleotide, whether 
occurring naturally as in a purified restriction digest or 
produced synthetically, which is capable of acting as a 
point of initiation of synthesis when placed under 
15 conditions wherein synthesis of a primer extension product 
which is complementary to a nucleic acid strand is 
induced, i.e. in the presence of nucleotides and an 
inducing agent such as DNA polymerase and at a suitable 
temperature and pH. The primers are preferably single 
20 stranded for maximum efficiency in amplification, but may 
alternatively be double stranded. If double stranded, the 
primer is first treated to separate its strands before 
being used to prepare amplification products. Preferably, 
the primers are oligodeoxyribonucleotides. The primers 
25 must be sufficiently long to prime the synthesis of 

extension products in the presence of the inducing agent. 
The exact lengths of the primers will depend on many 
factors, including temperature, source of primer and use 
of the method. For diagnostic methods, the primers 
30 typically contain at least 10 or more nucleotides. The 

oligonucleotide primers may be prepared using any suitable 
method, such as, for example, the phosphotriester and 
phosphodiester methods (Narang, S.A., et al., Meth. 
Enzymol. 68:90 (1979); Brown E.L. , et al., WetJi. Enzymol, , 
68:109 (1979)) or automated embodiments thereof. In one 
such automated embodiment diethylphosphoramidites are used 
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as starting materials and may be synthesized as described 
by Beaucage et al. , Tetrahedron Letters 22:1859-1962 
(1981). One method for synthesizing oligonucleotides on a 
modified solid support is described in U.S. Pat. No. 
4,458,066. It is also possible to use a primer which has 

^ been isolated from a biological source (such as a 
restriction endonuclease digest) . 

In a genotyping bioassay of the present 
invention, one embodiment comprises a gene-specific 
amplification reaction/ an exon-specif ic amplification 
reaction and a restriction endonuclease reaction. In such 
a reaction a suitable polynucleotide polymerase is used in 
the amplification reaction, many of which have already 
been described in the art. In addition, any appropriate 
restriction endonuclease which is designed to digest tbe 

15 DNA and so provide information concerning genotype may be 
used. 

It may further be necessary to provide a label 
on the nucleic acid for detection. The nucleic acid can 
be DNA or RNA and made detectable by any of the many 
20 labeling techniques readily available and known to the 
skilled artisan. Such methods include, but are not 
limited to, radio-labelling, digoxygenin-labeling, and 
biotin-labeling. A well-known method of labeling DNA is 
^^P using DNA polymerase, Klenow enzyme or polynucleotide 
25 kinase. In addition, there are known non-radioactive 

techniques for signal amplification including methods for 
attaching chemical moieties to pyrimidine and purine rings 
(Dale, R.N.K. et al . 1973 Proc. Natl. Acad. Sci. USA, 
70:2238-2242; Heck, R.F. 1968 S. Am. Chem. Soc. , 90:5518- 
30 5523) , methods which allow detection by chemiluminescence 
(Barton, S.K. et al. 1992 J". Am. Chem. Soc, 114:8736- 
8740) and methods utilizing biotinylated nucleic acid 
probes (Johnson, T.K. et al. 1983 Anal. Bxochem. , 133:125- 
131; Erickson, P.F. et al. 1982 J. of Immunology Methods, 
35 51:241-249; Matthaei, F.S. et al 1986 Anal. Blochem. , 

157:123-128) and methods which allow detection by 



wo 95/34679 



PCT/US95yO7605- 



- 22 - 

fluorescence using commercially available products. Non- 
radioactive labelling kits are also commercially 
available. Such a label can readily be incorporated into 
the nucleic acid during an amplification step. In the 
absence of an amplification step, a target nucleic acid 
can readily be chemically or enzymatically modified to 
carry a label. Additionally, it may be preferable to 
provide a labeled primer which may serve to incorporate a 
label into the nucleic acid target. Probes, as may be 
used in an embodiment of the invention may also be 
chemically or enzymatically labeled as described above. 

In a preferred embodiment of the invention said 
DNA primer hybridizes to an intron adjacent said position 
of said mutation. Preferably said DNA is a primer with 
the 3 '-end specific for the gene of interest. Preferably 
further still said DNA is single stranded. Preferably 
further still, in so far as the CYP2C9 mutation is 
concerned, said primers are as follows: 

HF18: position 8 of intron 2 onwards of genomic 
sequence in foarward orientation comprises 
5' TGCAAGTGCCTGTTTCAGCA 3' 

HF2R: position 505 onwards of cDNA sequence in 
reverse orientation comprises 
5 ' AGCCTTGGTTTTTCTCAACTC 3 ' . 

It is of note that both these primers are 
designed to be specific for cyP2C9 and so do not amplify 
related genes such as CYP2C8, which notably also has an 
Arginine,^^ present. 

Preferably, in so far as CYP2A6 is concerned, 
three primers J51, J61 and B are used in two parallel 
allele-specif ic PGR reactions. These primers are as 
follows : 

35 J51 comprises 5' GGCTTCCTCATCGACGCACT 3' 

(forward strand from position 479 of cDNA 
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sequence described as hIIA3 (Yamano, et al. 1990 
Biochejn 29:1322-29) ) . ^ 
J61 comprises 5' GGCTTCCTCATCGACGCACA 3' 
(forward strand from position 479 of cDNA 
sequence described as hIIA3v (Yamano, et al. 

^ 1990 Bxoch&m 29 : 1322-29) ) • 

Both J51 and J61 contain a substitution at 
position 18 of A for C to give improved 
specificity as suggested by Newton et al (1989 
Nuc. Acxds Rbs. 17:2503-2516). 

10 Primer B comprises 5' AATTCCAGGAGGCAGGGCCT 3' 

(reverse orientation from position 12 5 of intron 
3 of CYP2A6 (onwards) . Designed so that only 
CYP2A6 and not CYP2A7 or CYP2A12 are amplified. 

15 One method of genotyping cyP2A6 provides an 

allele-specif ic amplification reaction method is used. In 
this instance, DNA which is adapted to specifically 
hybridize to the wild-type or the mutant type of the gene 
is incubated with test DNA under reaction conditions and 

^0 the resultant products are analyzed by electrophoresis and 
then visualized by staining with ethidium bromide. 
Individuals who are homozygous for the wild-type allele 
produce a reaction product with primer J51 only. 
Similarly, individuals who are homozygous for the mutation 

25 produce a reaction product with primer J61 only. Those 
individuals who are heterozygous produce a reaction 
product with both J51 and J61. 

Alternatively, another method for genotyping 
CYP2A6 is provided in a specific amplification bioassay, 

30 which is achieved with primers F4 and R4 as follows: 

The F4 primer (forward) comprises 

5 ' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCAGGATT 
CATGGTGGGGCA 3 ' , wherein a preferred fragment 
35 thereof further comprises 

5 ' CCTCCCTTGCTGGCTGTGTCCCAAGCTAGGC 3 ' , 
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The R4 primer (reverse) comprises 

5 ' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTCTTAGC 
TGCGCCCCTCTC 3' , wherein a preferred fragment 
thereof further comprises 
5 ' CGCCCCTTCCTTTCCGCCATCCTGCCCCCAG 3 ' • 

This method of CYP2A6 genotyping involves a 
first amplification reaction with F4 and R4 primers, which 
generates a DNA fragment approximately 7.8 Jcb in size. 
This amplification step is facilitated by polymerases 
which are capable of transcribing long stretches of DNA. 
To distinguish the CYP26Avl and cyP26Av2 variant alleles, 
an exon-specif ic amplification step is carried out using 
the 7.8 Kb DNA fragment as template DNA. This may be 
accomplished using the following primer pair: 



The E3F primer (forward) comprises 

5 ' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCGCCAAG 
CAGCTCCTG 3', wherein a preferred fragment 
thereof further comprises 
20 5 / GCGTGGTATTCAGCAACGGG 3 ' . 

The E3R primer (reverse) comprises 

5 ' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCCCCCGC 
ACTCGGGATGCG 3', wherein a preferred fragment 
thereof further comprises 
25 5 / TCGTCCTGGGTGTTTTCCTTC 3 ' . 

Using these primers in a second amplification 
reaction step a segment of CYP2A6 exon 3 is specifically 
amplified. The method further comprises use of the 

30 restriction endonuclease XcmJ. to detect the CYP2A6vl 
mutation and Ddel to detect the CYP2A6v2 mutation. 

According to a yet further aspect of the 
invention there is provided a kit for performing the afore 
described methods which kit includes at least a portion of 

35 DNA in accordance with the invention and preferably at 

least one control sample of DNA containing the mutation or 
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mutations of interest and ideally also a wild-type sample 
of DNA so that suitable comparisons can be made. 

It is of note that although the method is 
described with reference to the above methods, any 
suitable method using the genetic material of the 
invention may be used to identify the mutations described 
herein. 

The CYP2C9 assay has been used in a study of 
warfarin dose requirement in 94 patierfts undergoing 
anticoagulant treatment and the results obtained are 
suxtimarized in Figure 5. 58 patients (61.7%) were 
homozygous for the wild-type (Arg^^J allele and were found 
to require a median weekly maintenance dose of 31.5 mg of 
warfarin. 36 patients (38.6%) were heterozygous and 
required a median weekly maintenance dose of 24.5 mg. The 
doses required by the two groups were significantly 
different (Mann-Whitney U-test, p = 0.016). No subjects 
in the group were homozygous for the mutant allele but 
based on allele frequencies and the Hardy Weinberg 
equilibrium, the predicted frequency of homozygous mutant 
sub j ects is 3.7.%. 

Comparison of the weekly maintenance dose of 
warfarin in the R144C heterozygotes (n = 36) and 
homozygous wild-type (n = 58) reveals that the 
heterozygotes required a significantly lower dose (range 
25 of 10.5 - 80. mg). Moreover, of the patients requiring the 
lowest doses to maintain an anticoagulation target (INR 
2.0-4.0), in the range 5-15 mg per week, 9 out of 10 were 
heterozygous. At the other extreme of weekly doses >55 
mg, 5 out of 6 patients were homozygous wild-type for 
30 CYP2C9. The significantly lower (20%) warfarin dose 

requirement of the patients with one variant R144C allele 
is consistent with the kinetic properties of the R144C 
protein with respect to (S) -warfarin hydroxylation and 
presumed in vivo metabolic clearance (Rettie et al. 1994 
35 Pharmocogen. , 4:39-42). 

The CYP2A6 genotyping assay has been used in 
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studies on coumarin metabolism. Coumarin 7-hydroxylase 
activity is a convenient marker activity to identify the 
presence of CYP2A6 in a particular sample. There is 
considerable variation in the ability of individuals to 
7-hydroxylate this compound which is a reaction specific 
^ for CYP2A6. A subject deficient in coumarin 

7-hydroxylation has been identified. This subject is 
homozygous for the mutant cyP2A6vl allele confirming the 
previous in vitro findings that sxibstitution of LeuI^O by 
His results in loss of coumarin 7-hydroxylase activity. 
As shown in Fig. 6, cyP2A6 genotyping and phenotyping with 
coumarin has been performed on other members of the 
proband's family and impaired coumarin 7-hydroxylation has 
been observed in hetero zygotes for the CYP2A6vl mutation. 

The genotyping assays described herein resulted 
from a two step amplification reaction wherein first 
amplification reaction amplifies a 7.8 Kb fragment 
containing the cyP2A6 gene (Pig. 9A) and a second 
amplification reaction amplifies an exon-specific fragment 
of CYP2A6. The amplification product was digested with 
restriction endonucleases producing different patterns for 
the various CYP2A6 alleles. Representative results 
obtained for several human subjects for the detection of 
the CYP2A6V1 (Xcml digestion) and CYP2A6v2 (Ddel 
digestion) are shown in Figure 9 panel B. A schematic 
2^ depiction of this genotyping assay is shown in Figure 9, 
panel C. Of 155 human genomic DNA samples analyzed 21 
heterozygous (+/-) and 6 homozygous (-/-) subjects were 
detected for the CYP2A6vl allele, whereas 17 heterozygous 
(4-/-) and no homozygous were identified for the CYP2A6v2 
30 allele variant. Additionally, 7 homozygous for both 
CPYP2A6V1 and CYP2A6v2 alleles were found. 

Allelic frequencies were calculated for either 
allele in several ethic groups and analyzed as shown in 
Table 1. CYP2A6V1 frequency is almost identical between 
35 Caucasian and Japanese, and it is only twice the frequency 

in Taiwanese samples. Significantly, this allele is 
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completely absent in the African-American population 
within the samples studied. The Japanese population has a 
remarkable higher frequency for the CYP2A6v2 allele (28%) 
as compared to the Caucasian (2%) , Taiwanese (6%) or 
African-American (2.5%) (ethnic groups) . 

Table 1 ; Allelic frequency for the CYP2A6 gene in 
different ethnic groups. 

Allelic Frequencies (%) 



Ethnic Group 


CYP2A6 


CYP2A6V1 


CYP2A6V2 


N 


Caucasian 


75 


23 


2 


52 


Japanese 


52 


20 


28 


40 


Taiwanese 


83 


11 


6 


178 


African-American 


97.5 


0 


2.5 


40 



The following examples illustrate various 
aspects of the present invention and in no way are 
intended to limit the scope thereof. All books, articles, 
and patents referenced herein are incorporated herein, in 
toto, by reference. Other similar embodiments will be 
clear to the skilled artisan and are encompassed within 
the spirit and purview of the present invention. 



EXAMPLE 1 

Method for determining the geno tvpe CYP2C9 

Genotyping for the CYP2C9 polymorphism is 
carried out by amplification by PGR followed by digestion 
with the restriction endonuclease Avail. Amplifications 
are performed in 0.5 ml microcentrifuge tubes in a volume 
of 100 \l\ containing 10 mM Tris-HCl, pH 8.8, 1.5 mM MgC12. 
50 mM KCl, 0.1% Triton X-100, 5% dimethylsulphoxide, 200 
;iM each of dTTP, dATP, dCTP and dGTP, 250 a^M of the 
primers HF18 and HF2R, 2.5 units Tag polymerase and 1 /xg 
human leukocyte genomic DNA. PGR conditions consist of 35 
cycles with a denaturation at 93 "G for 1 min. annealing at 
55 "G for 1.5 min and polymerization at 72 'G for I min. 20 
35 ^1 of the amplified DNA is incubated with 10 units Avail 
for 3h at 37 'G and then analyzed by electrophoresis on 
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1.8% agarose lainigels in TBE (90 mM Tris-borate, 2 mM 
EDTA) buffer. The digestion products are visualized by 
ethidium bromide staining. DNA from individuals positive 
for the wild-type Arg,44 is digested to give fragments of 
270 bp and 50 bp whereas in individuals with the mutant 
Cys,4^ present, a band of 320 bp is seen due to loss of an 
Avail site (Figure 3). 

EXAMPLE 2 

Genotyping for the cyP2C9 polymorphism was 
carried out by amplification by PGR followed by digestion 
with the restriction endonuclease Avail. 

One hundred patients were recruited from two 
anticoagulation clinics in the Newcastle area over four 
study days. Body weight and height were measured,, the 
basal metabolic index ("EMI") calculated for each patient 
and details of age, sex, drug history, current and 
previous International Normalized Ratio ("INR") 
determinations, indications for anticoagulation and other 
significant health problems were all recorded. DNA was 
isolated by a standard manual chloroform-phenol extraction 
procedure and 1/xg was subjected to PGR analysis. As shown 
in Figure 10 the C-^T substitution, which converts Arg-^144 
to Cys, resides in exon 3 of the CYP2C9 gene and results 
in the loss of an Avail restriction site 

25 f G AGGACC GTGTTCAA. . . ) in the R144C allele 

( . . •GAGGACTGTGTTCAA. . . ) . This provided the basis of the 
amplification strategy. A CYP2C9 specific intron forward 
primer (HF18, TGCAAGTGCCTGTTTCAGCA , Figure 10) and a 
CYP2C9 exon 3 3 '-end reverse primer (HF2R, 
30 AGCCTTGGTTTTTCTCAACTC , Figure 10) were used at a 
concentration of 250mM each. Amplifications were 
performed in a volume of 100 /zl containing 20 mM Tris HCl 
(pH 8.3), 1.5 mM MgCl2, 25 mM KCl, 0.05% (w/v) Tween 20, 
10 /ig gelatin/ml, 2% (w/v) DMSO, 200 fiK each of dATP, 
dCTP, dGTP and dTTP and 2.5 units of Tag DNA polymerase 
(Perkin-Elmer) . Reactions were carried out for 3 5 cycles 



20 



35 



wo 95/34679 



PCTAJS95/07605 



- 29 - 

o 

at an annealing temperature of 55 "C for 90 sec, a 

polymerase temperature of 72* C for 1 min, and a heat ^ 

denaturing temperature of 93 'C for 1 min, using a Perkin- 
Elmer Cetus DNA thermal cycler. The PGR products digested 
with Avail and sized using NuSieve agarose gels (3% 
^ NuSieve, 0^75% agarose). Presence of the CYP2C9 wild-type 
and R14 4C alleles were detected as fragments of 50 + 270 
bp and 320 bp respectively (see Figures 3) . The PGR _ 
product synthesized from human genomic DNA with the 
primers HF18/HF2R was directly sequenced on an ABI 373A 
automatic sequencer. Briefly, the PGR product was first 
purified by using the Wizard DNA clean-up system (Promega 
Co., Madison, WI) . The purified template was then 
subjected to dideoxy terminator cycle-sequencing with the 
primers HF18 and HF2R. The primer-extended products were 
purified and sequenced following the manufacturer's 
procedure. Sequence analysis was done by using the 
MacVector software program (Eastman-Kodak Co. , Rochester, 
NY) . 

DNA was obtained from 94 patients. Of these 58 
(62%) were homozygous for the wild-type GyP2C9 gene and 36 
(38%) were heterozygous for the R144C allele. No R144C 
homozygotes were found. The frequency of the wild-type 
(Arg-144) and R144C (Gys-144) alleles in the study 
population is thus 0.808 and 0.192 respectively. An 

25 expectation of 3.7% R144C homozygotes can be anticipated 
from the Hardy-Weinberg equilibrium, but the 95% 
confidence interval in this estimation of 0.8-8.4% and 
thus the finding of zero homozygotes in 94 patients is not 
significantly different from expectation. The specificity 

30 of the PGR reaction with respect to the GYP2G9 gene was 
confirmed by sequencing. The alignment of the sequence 
obtained from the PGR product with that corresponding to 
the GYP2G9 gene showed a 100% degree of homology. 
Interestingly, a heterozygous pattern was obtained for the 

35 R144C allelic variant, confirming the high frequency of 
this allele within the normal population. No sequence 
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deriving from cyP2C9, CYP2C18 or CYP2C19 was found 
confirming the specificity of the assay for CYP2C9. 



EXAMPLE 3 

Method for determining the aenotvpe CYP2A6 

Genotyping for the CYP2A6 polymorphism is 
carried out by allele-specif ic PGR using two parallel PGR 
reactions, one specific for the wild-type allele, one for 
the mutant allele • Amplifications are performed in 0,5 ml 
microcentrifuge tubes in a volume of 45 ^1 containing 10 
mM Tris-HCl, pH 8.8, 1.5 mM MgClg, 50 mM KCl, 0.1 % Triton 
X-100, 5 % dimethylsulf oxide, 200 ftM each of dTTP, dATP, 
dCTP and dGTP, 250 /iM of the primers B and either J51 or 
J61, 1.2 5 units Tag polymerase and 1 ftg human leukocyte 
genomic DNA. PGR conditions consist of 40 cycles with a 
denaturation at 93 *C for 1 min. , annealing at 57 "C for 2 
min and polymerization at 70 for 2 min. The products 
are analyzed by electrophoresis on 1% agarose minigels in 
TBE buffer and DNA is visualized by staining with ethidium 
bromide. As shown in Figure 4, there are three possible 
results: the individual may be homozygous for the 
wild-type allele and give a DNA product only for the PGR 
reaction with primer J51, the individual may be 
heterozygous with one wild-type and one mutant allele and 
give DNA products with both primers J51 and J61 or the 
individual may be homozygous for the mutation and give a 
DNA product only with the J 61 primer. 



EXAMPLE 4 

Alternative Method for Determining the Genotype CYP2A6 

For use of F4 and R4 primers, each reaction 
mixture contained 600 ng human genomic DNA, 0.2 /iM of each 
primer, 2 00 dNTP's, 0.8 mM magnesium acetate and 2 
units of rTth I DNA polymerase. Hot start was as 
indicated by the manufacturer (Perkin Elmer) and the 
amplification reaction of 31 cycles of 93 'G, 1 min; 66'G, 
6 min 30 sec. Amplification products were analyzed in 
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0.7% agarose gels and the DNA visualized by staining with 
ethidivim bromide. For the exon 3 specific amplification, 
the reaction which uses, the primers E3F and E3R consist 
of 5^1 of the 7.8 Kb PGR reaction, 0.5 fiM of each primer, 
200 nVl dNTP's, 1.5 mM MgClj and 2.5 units of Tag DNA 
^ polymerase. The amplification reaction consisted of 94 'C 
for 3 minutes followed by 31 cycles of 94 'C, 1 minute; 
60* C, 1 minute and 72 'C, 1 minute. 

Amplification products were then digested 
without purification with restriction endonucleases which 
detect the CYP2A6 wild type (no digestion) , CYP2A6vl 
(Xcml) and CYP2A6v2 (Udel) . DNA was visualized by use of 
ethidium bromide after electrophoresis in 1% agarose, 3% 

NuSieve agarose. 

it is of note that CYP2C9 genotyping can be 
15 performed using an allele-specific assay similar to that 
used above for CYP2A6. 
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CLAIMS 

1. A CYP2A6V2 DNA having a coding sequence 
shown in Figure 11. 

2. The DNA of claim 1 having a genomic 
sequence as shown in Figure 12. 

3. A CYP2A13 DNA having a coding sequence 
shown in Figure 13. 

4. The DNA of claim 3 having a genomic 
sequence shown in Figure 14 . 

5. A nucleic acid primer sequence comprising 
at least ten (10) contiguous nucleotide bases selected 
from the sequence showing in Figure 12. 

6. A nucleic acid primer sequence comprising 
at least ten (10) contiguous nucleotide bases selected 
from the sequence shown in Figure 14. 



7. A nucleic acid primer sequence selected 

from the group consisting of: 

A. 5' GGCTTCCTCATCGACGCACT 3'; 

25 B. 5' GGCTTCCTCATCGACGCACA 3'; 

C. 5' AATTCCAGGAGGCAGGGCCT 3'; 

D. 5' TGCAAGTGCCTGTTTCAGCA 3'; 

E. 5' AGCCTTGGTTTTTCTCAACTC 3'; 

F. 5 ' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCA 
30 GGATTCATGGTGGGGCA 3 ' ; 

G . 5 ' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTC 

TTAGCTGCGCCCCTCTC 3 ' ; 

H . 5' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCG 
CCAAGCAGCTCCTG 3 ' ; 

35 1.5' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCC 

CCCGCACTCGGGATGCG 3 ' ; 
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or any nucleic acid sequence of at least 10 contiguous 
nucleotides selected from any one of A-I. 

8. A method of determining the presence or 
absence of an allelic variant in CYP2A6 or CYP2C9 DNA 

^ comprising; 

(a) amplifying an exon containing a variant 
sequence with in said DNA, producing an 
extension product; 

(b) treating extension products with at least 
10 one restriction endonuclease under 

conditions sufficient to produce digestion 
fragments ; 

(c) analyzing the digestion f ragments , f or a 
variant specific digestion fragment or lack 

15 thereof. 

9. The method of claim 8 wherein a CYP2C9 
variant DNA is being detected. 

20 10. The method of claim 9 wherein the 

amplifying step is a polymerase chain reaction using 
primers comprising HF18 and HF2R. 

11. The method of claim 8 wherein step (a) is 
25 preceded by a gene-specific amplification reaction. 

12. The method of claim 11 wherein the gene- 
specific amplification is a polymerase chain reaction. 

30 13 . The method of claim 12 wherein a CYP2A6 

variant is being detected. 

14 . The method of claim 13 wherein a gene- 
specific amplification reaction uses primers comprising F4 
35 and R4 and the exon amplification reaction uses primers 
comprising E3F and E3R. 
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15. The method according to claim 10 wherein 
the extension products are treated with the restriction 
endonuclease Avail. 

16. The method according to claim 14 wherein 
the extension products are treated with at least one 
restriction endonuclease comprising I>del and XcjnI. 

17. A method of determining the presence or 
absence of an allelic variant in cyP2A6 or CYP2C9 DNA 
comprising: 

(a) contacting said DNA with a first primer 
encompassing a nucleotide variation 
specific to variant DNA and a second primer 
which is complementary to a region of said 
DNA such that upon hybridization and 
amplification, an extension product will be 
formed; 

(b) analyzing the extension products for 
allelic-variant specific extension 
products . 



18, The method of claim 17 wherein a CYP2A6 
variant DNA is being detected. 

19. The method of claim 18 wherein the 
amplifying step is a polymerase chain reaction wherein the 
first primer comprises J51 and J61 and the second primer 
comprises primer B. 

30 20. A kit for determining the presence or 

absence of an allelic variant of CYP2A6 or CYP2C9 DNA 
comprising: at least one nucleic acid primer sequence 
capable of hybridizing to said DNA; the kit further 
containing instructions relating to the determination of 

35 the presence or absence of an allelic variant of CYP2A6 or 
CYP2C9 DNA. 
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21. The kit according to claim 2 0 further 
comprising amplification components and at least one 
restriction endonuclease. 

22. The kit of claim 20 wherein the cyP2A6 
allelic variant is being detected. 

I 

23. The kit of claim 22 wherein the nucleic 
acid primers comprise F4, R4 , E3F and E3R. 

24. The kit according to claim 20 wherein the 
CYP2C9 allelic variant is being detected. 

25. The kit according to claim 25 wherein the 
nucleic acid primers comprise HF18 and HF2R. 

26. A process for providing a human with a 
therapeutic CYP2A6v2 or CYP2A13 DNA segment said human 
cells expressing in vivo in said human or therapeutically 
effective amount of said protein. 

27. A pharmaceutical composition comprising an 
antisense nucleic acid derived from CYP2A6v2 DNA. 

28. A pharmaceutical composition comprising and 
antisense nucleic acid derived from CYP2A13. 
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2A6 incron 2 
2A8 incron 2 
2A7 incron 2 

Conaenaua 




42 
49 
41 

SO 



2A6 Incron 2 
2Aa Incron 2 
2A7 Incron 2 

Condensua 



2Afi incron 2 
2AB incron 2 
2A7 incron 2 

Consensus 



ice AGCCTTCrCC CTGACTCTCC TC 
ACCCTTCrCC CTGACrCTCC 




1CIACTGG AG(^7;¥1GCp 

j^crcG AG^;r — 



-GG AGCrX 



TCCCCAT CTCCC3JI 
rCCCCAT CTCCC^I 

:ccAT crccriji 

1 



a: crcco 



— He 




92 
99 
91 

100 



142 
149 
X41 

150 



2A6 incron 2 
2Ae incron 2 
2A7 incron 2 

Consensus 



c:TCTciic|T cc; 



cc J TGT( 



TG TCTCCAGC^ 

TG Tcrc; 




:3 




192 
199 
190 

200 



2A6 incron 2 
2 AS incron 2 
2A7 incron 2 

Consensus 




242 
249 
239 

250 



2Afi incron 2 
2A6 incron 2 
2A7 incron 2 

Consensus 



crccTCTcrc tccj 
c:ccTCTrrc tcc;^ 



cicc 

cUc: 



rrcTC Tc; 




AGCACATCirT 

aggacatc zz 
acgacatc — 



CGGTTTCTGT TTACCACCCC 
GG^ i' r T C~GT TTACCAGCCC 



GGGTTTCTGT TTACCAGCCC 



292 
299 
267 

300 



2A6 incron 2 
2A6 incron 2 
2A7 incron 2 

Consensus 



2A6 incron 2 
2 AS incron 2 
2A7 incron 2 

Consensus 



TGGGTCTCTG TCTACATGAG TCTT TG;»CZ;C 

TGGTcrrcrG tcttcatttc Tcrrrriarc 



GCTCT C GGCT TCTCT 




TcrGXGTTTc TorrcTCTcr GswTcccrrr CYCvrrrcTT cctctctc 



342 
349 
271 

350 



392 
399 
271 

400 



2A6 incron 2 
2A8 incron 2 
2A7 incron 2 

Consensus 



AGGATGCCAG GGTTATTCCT ACTTCCACAT 
AGGATTTCAG GG7-ATTCCT ACTTCCACAT 



CTTCAGGCTC CATCTCCTGG 
CTCCAGCTCC CAACTCCTGG 



AGGATKYCAC GGTTATTCCT ACTTCCACAT CT"!fCAG3*rrC CAWCTCCT 



442 
44B 

271 

450 



2A6 incron 2 

'2A8 incron 2 

2A7 incron 2 

Consensus 



TAACAGTCTC TCTTCCTTCC AGACCCTCT 
TA»vTTG*CT3 TCCTCCTTCC CGATCCTC" 



TAA-r.'JCTCTS TCTTCCTTCC MGA^CCTCTC TGTTTCTRTC TCTlATATTVrw 



4 92 
4 98 
271 

500 



2A6 intrcn 2 
2A8 incron 2 
2A7 incrsn 2 

Consensus 



ACT^.CT— G CTCr.\GrTCA GCTTAACAAT 
.».».w.-.. C. Cwrf^vjTTCA GATTAAGAAT 



CTCACACCAA GACAGCAT3T 



WC.C — .».K CTCZAGTTCA G*^ . ^ AAG- 



AAT CTYWCACCAA GWKWKKATKT 



540 
546 
271 

550 
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2A6 Intron 2 
2AB intron 2 
2A7 incron 2 



CCTCCACCCA GATCTCCCCA TATCTCACTA CCCCACCCTC CATC CTC 

CCTCCTCCCA GATCTCCCCA TATCTCACTT CCCCTCCCTC CATCTCTCTC 



Consensus CCTCCWCCCA GATCTCCCCA TATCTCACTW CCCCWCCCTC CATCTCTCTC 
2A6 incron 2 TGCCT C CATCAC — TC TCTTTCTC TCC CC — A 



2A7 incron 2 
Consensus 



2A8 incron 2 
2A7 incron 2 

Consensus 



2A7 intron 2 
Consensus 

2A6 incron 2 
2A8 incron 2 
2A7 incron 2 

Consensus 

2A6 incron 2 
2AB intron 2 
2A7 incron 2 

Consensus 



2A6 incron 2 
2 AS incron 2 
2A7 intron 2 

Consensus 



2A6 incron 2 
2AB incron 2 
2A7 incron 2 

Consensus 



2 AS incron 2 
2A8 incron 2 
2A7 incron 2 

Consensus 



TTTCTCTCCC 


CACTACCTTC 


CCTTCCTCCA 


TGGAGTATCC 


CCGTATCCC7 


TKTCTCTCCC 


CATTACCTTC 


YCTTTCTCCA 


TGGAGTATCC 


CCGTATCCCT 




GCATCTGTCT 


GTCTGGCCTT 


TGTG~— 
TCTGCTTCTC 


GA G 

TTCTGATTCT 


CTGYyyCTSY 


GSAXSyGWYY 


SWMTGGCCWK 


TSTGCTTCTC 


TTCTGATTCK 


CTAATGCCGT 
CTTATTCTTT 


^-GAA 

CTACCCGGAC 


GCTATGTGCA 
TCTCTCTCTC 


TCTCTCTCTC 
TCTCTCTCTC 


TGGCCGTACC 
TCTCTCTCTC 


CTWATKCY3CT 


CTACCCGGAM 


KCTMTSTS^ 


TCTCTCTSTC 


TSKCYSTMYC 


TGGGT AA 

TCTCTCTCTC 


TAACCTGATC 
TCTCTCTCTC 








TCTCTCTCTC 


TCTCTCTCTA 


TATATATATA 


TSKSTCTCWM 


TMWCYYKMTC 


KMrrrcTCTC 


TCTCTCTCTA 


TATATATATA 


TATATATATA 


CACACACACA 


CACACACACA 


CACACACACA 


CACACACATA 


TATATATATA 


CACACACACA 


CACACACACA 


CACACACACA 


CACACACATA 


TATATTAGGG 


GGGGACTCCC 


TTTCTGCTCC 


ACCCTTGGGG 


AGCCCCTTGG 


TATATTAGGG 


GGGGACTCCC 


TTTCTGCTCC 


ACCCTTGGGG 


AGCCCCTTGG 


AACTGGTCCG 


CTCTGCTACC 


ACCACCCCCT 


GACCTCTCTC 


C AC C \. . w o\, 


AACTGGTCCG 


CTCTGCTACC 


ACCACCCCCT 


GACCTCTCTC 


CACCCCCGCG 


TTCACCTCCC 


CA 








TTCACCTCCC 


CA 









587 
596 
271 

600 



615 
646 
271 

650 



650 
696 
271 

700 



693 
746 
271 

750 



714 
796 
271 

800 



714 
84 6 
271 

850 



714 
896 
271 

900 



714 
946 
271 

950 



714 
958 

271 

962 



Intron 2 alignment 
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2A6 exon 3 
2A6 exon 3 
2A7 exon 3 

Consensus 



2A8 exon 3 
2A6 exon 3 
2A7 exon 3 

Consensus 



GCGTG^CfcT CAG Z AACI 
GCGTGGWTT CAGCAACGGG 
GCGTGG3CXT CAciiAACi 



'SGG SASCGCGCCA AGCAGCTCC^ IGCGCTTC^CC 
GAGCGCGCCA AGCAGCTCOT GCGCrrpqc: 
GAGCGCGCCA AGCAGCT C^T iGCGC 



2A6 exon 3 


ATCGCCACCC T 




2A6 exon 3 


ATCGCCACCC 1 




2A7 exon 3 


ATCGCCACCC 1 




Consensus 







qGqijsTGSGC AAGCCSpGCA TCSAGG. 
GTGGGC AAGCCsIgGCA 7CGAC 





Codon 160 



CATCCAGSAG 
CATCCAGGAG 
CATCCAGGAG 



:GGGCr TCCTCATCGA C GCC^CCGG 
CGGGCT TCCTCATCGA u GCq^3:GG 
TCCTCATCGA : GCC 




^rimer J51/61 




GCACGCACG 
GCACGCACG 
GCACGCACG 



50 
50 
50 

50 



IOC 
100 
100 

100 



ISO 
150 
150 

150 



Exon 3 alignment 
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2A8 Incron 3 
2A6 incron 3 
2A7 incron 3 

Consensus 



GTGAGCAGSS GACCCCGAGT Gi 
GTGAGTAAGG 7TCCCCGAGT GCi 

GTGAGYARGG KHCCCCGAGT 



1|GA4 

-GAC^GC^ 
-GAOAAGC^ ^ 




rCdqaSAGTG 

4;icrd 



lAGTu 



21 
44 
44 

50 



2A8 incron 3 
2A6 incron 3 
2A7 incron 3 

Consensus 



GAACC — CGC G Cn rTCTGCC TCBGGAT 
GAACC ZC CSC GC : 



aiAGG3CG-G 
CCAGCrCGFG 
CCAGGFCGFG GAACdCClCGC GCfc 



rrcT G Cc 
rrcTGCC 



lAT 
AX 



TG C GGAT 

tg:gg. 



GACTAutGj 
GACTAGGj 



68 
94 
94 

100 



2AB incron 3 
2A6 incron 3 
2A7 incron 3 

Consensus 



ggaaaggsgc ccgcacttcc 
ggaaaggigc ccgcacttcc 
ggaaagg:gc ccgcacttcc 



IrrsiianCTHnr mnr^r^r Mrrr^m^ 



i^SCCCTGG;^ z 
it:CCCTGG2)S 

JR F ccctgg;! \ 




117 
144 
144 

150 



2A8 incron 3 
2A6 incron 3 
2A7 incron 3 

Consensus 



2A8 incron 3 
2A6 incron 3 
2A7 incron 3 

Consensus 




SC CwTGCCXCCT GSAATTCXGA CTCTCCTqAG ACCTCTGAGI 
CCTGCCTCCT GGAATTCTGA CTCTCCTCAG ACCTCTGAGI 
CCTGCCT C CT GGAATTCTGA CTCTCCTCAG ACCTCTGAGT 



Primer B 



TGACTCTCTC CCCAACCCCC 
TGACTCTCTC CCCAACCCCC 
TGACTCTCTC CCCAACCCCC 




167 
194 
194 

200 



207 
233 
235 

241 



Intron 3 alignment 



FIG. 2 (Sheet 4) 



wo 95/34679 



12/29 



PCT/US95/07605 



1 2 3 4 5 6 7 





^300 bp 
<-200 bp 



FIG. 3 
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1 2 3 4 5 6 7 8 




<r-300 bp 
<-200 bp 



FIG. 4 
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12 34 5 678 9 

HHMHl— Ch-HH^-^- 

12 34 5 678 9 

-{HI— (H^— [^HH}-M 

12 34 5 678 9 

-{HI— H — [F— HHHl 



FIG. 8B 



CYP2A6 wild type 



CYP2A6V1 



CYP2A6V2 



wo 95/34679 



® 



PCT/DS95/07605 



19/29 



1 2 3 4 5 6 



Xcm I 



Ddel 



7.8 kb 




201 bp 
141 bp 

60 bp 





F4 



12 3 4 



6 7 8 9 



M— H 0 Hl-Mpi:^ CYP2A6 




Ode I Xcm I 



7.8 kb PGR 
product 



E3F 



E3R 



Exon 3 



201 bp 



141 bp 



59 bp 



142 bp 



60 bp 



CYP 2A6 wild type 
CYP 2A6v1 
CYP2A6V2 
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CYP2A6v2 cDNA. 

GACTGnaATGGTCTrcATCnUIGITIXjGCA 

GcixKxnoiGGGAaxAcaxiATnaoccnc^^ 

ACACAGACKZAGATXnACAACTXDCXnCATGAAGAT^ 

CXJ b ' lU ri lA OCATICAlCniGGGGOCXXXXX^^ 

ATGGOmiAaGGAGQCTCnGGlGGAailAGGCTC^^ 

GAGCAAGOCAGCTiaSACrGGGTCTrcAAAG^ 

AGDGGGAOCXXXSOCAAGCAGCTaCTGDGCm^^ 

TOGGC3GroGGC>WW30GAGGCATCX3AGGAGCX3^^ 

CTtMaSAGGOCATODGGAGCAOXAOD^^ 

CIGAGCXOCACACnCIXXAATGriOVT^^ 

TCACTATAAGGACAAAGAGrrCCIXjTCACKyiTC^ 

CTRXAGFTICACCnO^AraiOCACGGGC^ 

TGATGAAACACX3t3(X:AGGACCL\CAGCAAQ\^^ 

GCTGGAGCiACrrcATAGCXAAGAAGGKXi^^ 

TCXXAATTOXrAOGGGACITCVaTGACra 

AGQAGAAGAAOaXAAOVOQGAGITCTACTIGAAGAACxn^ 

ACGriX3AACCIXnTX:ATn3Q\GGCAO^^ 

TQGCnCITGCroCICATGAAGCAaXAGAGC^^ 

GAGATTGACAGAGTQATCXjGCAAGAACXGGCAGOCXj^GT^^ 

GCCAAGATCCXXTACATGGAGGCAGTGATCCACCjAGATXX^^ 

GACXjTGATCGCX^ATGAGriTCGCOCGCV^GAGrc^^ 

GGGATITCTianran"AAGGGCATAGAAGrGriaX^^ 

CraAGAGAOCrCAGGTICTIOaiV^^ 

OfGOGIXSAGAAGGGGCAGITrAAGAAaDgrGATGa^^ 

TCAGAAAGCGGAACimTIXXXSAGAAGGCCrcGCCAGAATC^ 

CTICnCACX:AO0GICATGCAGAACr^^ 

AGGACATIXL^ajTGrCXXm\AACACCJIXXXX^ 

rTAP AfY^Air^Araj i ■ 1 1 1 :i tiULrrxsati AGaiAGGGcnxmxxrK i iT^^ 

OITiGGOCXXinnCAGGGAAAGGGC^GGGCrAAGAaXK^^ 
r^GCTAAGACTC«3GGGCAGGATGG(^AAAGGAAGGGGCXnt3 C^^ 
AnGGAAGAnAAGAAACAOAAina^GCirAGTIXZACXTroATAAGCn^^ 
OAGCnXlGGATlGAGAGGAAGGAAACCCrrACATrATnCTATGAAGACT 
AATAATAGCAGCTCITATTTCCrGA 3 ' 



FIG. 11 
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1 AAGITCCCCT GAAAXMGGC TCTGCSTCTTC CTCCCCrTGC CA^TGA^ ^^^^ 
61 WgStCTAT GGCJU3CCATC CTGGCCTCAC TCTGAGGTTC CAATCSAGGAT ^CTQGQCATC 

iai G^OTTCTC GGASAACGCC GCTCQGCTTG CTRCACACTC CTCCTCCCAG **A«^CACA 
SSgCCC TGGSTCTTCC TACOTCGAC ACTTTCIVACT CCATATGCCT G^J^fCCC 

SSS^ StSSS SSSptat cT.m«:ccc ctcctaajtc 

SSS^ ^ScSt cctowustac cacagatto* gictgoaggc ccc^^ 

Ifii SSStGG GGCArerACT TGGSAGOTGA AAneACGTAA ■PTATGXAATC AGCCWlACTC 

i ^sss 

i ^ s?= ^ 

i"i oa«»«« «C«WVTCT 

-.-lii-i *^-nr-«wwa ARnATCeCAG CACATGACAT CTCGGTGCTG GGCCCCATTU MMaxwuw^ 

i^Jx SSSS JSS^ S^cTc 

1261 TlCACCaaTC ACTTGOMCX: CCGQCeCCSaPC GTGGnCCTGT GTCGACATCA TG^T^GG 
TGGACCACOC «»G^ 

"^""^^ SS^S sssss 

isot ISSS SSSS ?Sc^ tCCATG^^A T«CT«CCT 

g^XtotcCT gattcctccc tccctctctc tgccccacct ccitattctc 

llll TtSSaCOC STCTACAMA GTCTTTOftCG CCCTCTTACC '^^BQ^ 

SSS^ ScSSS dOGATCCCTT TCTCAATTCT TOCTCTGTCT 
laoi QGCTTATTCC TACTTCCACA TCTTCRGOT VCCATCTCCTG GTAACACTCT CTCPtCCTTC 
nil SSSJSS SSJSS CTCAATAilTA AACTCTCTGC ''^C^GCT^G 
1921 T^CtaAG AGACCMJGTC CTCCACCCAD ATCTCCCCAT ATCTCACW^ ^^l^^ 
nil S^SSS tSwcacic TCTTTCTCTC CCCACTGCSC CTGCGGA^ 
2041 S^GTCGAGC TAAITOCCGTC AAQCTATGIG CATCTCTCTG TCTCGCCOTA ^^^J 
IScc^^S ACTAGSCOIG CTATTCAGCA ACGGQGAGCG CGCCAAGCAG CTCCTOCOCT 
liSi JJSS^ SSgtc^ GACTTCGGGG WGGCAAGCG ACCCATCGAG 
nil ISSS^ «^ScC^ ATOauara TC^^ GCACGGTGAG cagggct^c 
SJi SSSSa GGAAAAOiCC CAGGACGJ^ SSSS 

SSSSqGG AcSgCTCTC GJOUkGGCKCC CGCACTTCCA GCCCICGAGT CTGGC6CTGG 
Soi SS^S? SLaAGGCC CtCCCTCCXO GAA^XGAC ^^CCT^ ^SSSS 
^ifil ^erCTCTCC CCAACCCCCT TCTCCCGACA taccccgrgg cgccaatatc CAggccaccT 

2Mi S^CAGXC TCCAATGTCA TCAGCTCCAT TGTCrTTG^ ^SS^S 

SSSa CAAAGAGTIC CTGTCACl^ tocccaogat GCTAGGAATC TJC^GTXCA 
2581 CTAATeCTTC CAC C CCGGCC CCTGAAGGCC CTTACCAAAA 

2641 CGTCAACCTC CACGGGOCAG CTAtow^ g^CM»AA TTCCCACCGC CCCCCGGACA 

"t S S ^ ^ ™ ™ 
ISl SS^SS S^SS oo=««<^ SS^SS 

llil ^SSSS SStccc^ taaacttxac 

3001 ATTXTAAfc^ lX«^fp*^m.« ACCTMCCAC CGGACACCAG ATGCCTTTAA CTCACHTCCT 

SSSa^ JSSSSS S^S^ cccgtgacag ctgtccttcc 
^^^^ 1^^^^^ ^P-^CTGCA accccrgctc txtoagatct tctcttcggt gatcaaacac 
^^^5^ SSSSS SSctJtSI tigctgcaag ggciggagga cttcaiaccc 
^I^SS^ SSS^ gScaS gatcccaao^ ccccacggga cttcatigac 

FIG. 12 (Sheet 1) 
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C*GGCAGAGG GWIATCAGTC ■reGGM?rGGG GCAGGCAGAT GACRCMGCC CMT^J^ 
3IS SSJSS? ATAATAXTCC TCftCAAl^ CTGOnCCCG TC^CTAAC^ ^SSiS^ 

JiSSSL. c^c=cca*G ^^J^T 

raei S= STcSS SS= j_c 
;«y.a**j«»G GCCCTGTcnc wu«u^ ^^SS JSSSiSi ^SJSttg 

raS =^ ^^s^ 

1= s= ^ 

4141 CCCTCTTGW: CCCCMTGGT CIBMCCrM, ^^^^^ SScSctC CCTTCRCGAG 
I2OI AT^STOCCCT GAGGTCA^ J^S^ SSSSS S^^Sc 
4261 GAGW^WVCC CCWkClVCGCA SSSSS I^StCTT ACTGCTCAIG 

4321 TTCATTOCAG «*CCG»^ ST^^^ SSS^ ISSgGGC CTCAGACCCT 
4381 AWXACCCM AOGTOOMSG TMCCCTBGA ^StoSg GWXC1«AGA 

4441 CAAAATTC^ SSSSSS SSSS S^SSS SSSSSS ^^AGTCOTXT 
45D1 CGTGACTTCC TCTCCjaftGX JStoG^CA GTCACTTCTG TCCCWVGCCC 

4561 TAGATATTi^ ^J^SSi ^SSS GTTCCTCCCT OTGCCTCCCC 

4621 ACTGAGTGCC CACTGCCCGT TCCft CUWW»x "^^^^^^ aCAATCCCSAA TCCGTGATGT 

=S ^ S — SSI IS^^ 

^^=1 

4921 CCCAAGTTTG AGGJ^CCGCXSC «iXvUii«ir TCAAAAAGGA CACCAAGTTT 

4981 AGRTTTQGAG ACGTGATCCC CXTGAGTTTC ^C^CCA^ SJ^S GGGGACTCCA 
nHAi gGf3GV ri ' l - C T TCCTCCCTJIA GCSTGCTAtCC GCCOCCACCC CCCAGftC T^ «t^!«v^r-iir- 
5041 CGQCa^TTCT r^KWWlCCC ACATTAGAAC CTTTCTAGAC CCTGTCCCAC 

5101 GCCCCTCTCT C5TGTCCCCAG CATCCCACtt J^^^^ CCOTTCCACC TTTCCACTTA 

IS ™ S 

is:™ sssss 

55B1 ATMaUUSTCT TCCC«lTGrP eGGCT^ SSJ^SS S^SaGAA (OCTGA'reCT 
5641 CGGQACTTCA ATCCCCAGCA CTTCCTGGST ^A^aAC^ SSS«C TACTCACACC 
5701 TTTGO^XCT TCT^^JJO ^J^^ SSSSS J^^SJ TTCCCCAGCT 
5761 AGCMQGGCC TCCCTTACCC ACTTCCMK Tf^^J^ TGATACTCCC TTAACTACCA 
S821 TGGOVACTTC etGTTAGCaA TCW^CGTTO JGffi^CJTO ^^^^^ tTCAGAGGCG 

5B81 Aocj^ jcc^ SISStca cac^ux^oat 

5941 ogogaaaacc aaakccaga cacagcaggt catatttogc agttcttatc 

6001 TCTTCAGCAT C^*AJAAG ^gj^^ S^CACCT GGCATCGATC AACCCCATCT 
6061 TGGGGGAhOO GGGATCTTAA 'fCTCra^ SSStcAA GGWXGTCAA GRGGCTCCCT 
6121 TTTGGTCATC TirrrGQGTCA ^CM^^ ^SJSS^ TG^GAGCC GCAGCTGGAG 

^^^It:^^ SSS gS^SSI SSSS? CTCCACCCCT cccgcctctc 

6241 GTCGCTACTG SttSgAK^ AGGCCTGCCC AGAATCCaVGC TCTTTCTCTT 

.6301 CTCCTCAGGA AAtSCGQAACT «»™^^^ ^IctCCTCC CAGTCACCTA AGGRCATTCA 
,361 CTTCACCACC ^JCATQ^GA J^JJ^ ^SSS SSSSLa TGACCTTCCT 

6481 GCCCCGCTGA O^AGGGCTC ^^^^ acCTAAGACT GGGGOCAGGA TGGCCGAAAG 
6541 GGQCCAACAC CC3GGCTTCGS ^^^^ JSaCaSaAC CGGCTCXCTT CACCTTGATA 
^^SSi ISSSSS ATTATOCTAT GAAGACr^ 

6661 AGGTGCTTCC ^^ff^^ ^i^^A^ TACCCCCGTG TCACCTTTGT TCAAAAACCA 
"II i^cSS SSSS cSS;SSS CCCTTCOAAG CGCCO^AT GCCCATrTTA 
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6961 CTATTCCTCA CGCA«AACftG J^^^^ SSiScX TGCCCACACT 

7021 GGACrrCCCC AGAGACCTGG GGGGTGGTTG CCCTCCCTTC cp^gvcaAGG 
7SS ?^*CT CAACAT.SCTG TGACTACCCS ^SSlSS SSSSs SS^S^ 
7141 CCACTOTAGC CCATTCACAG TCftGCXCAGG (aVCACIVACGX GACATOACTB Uftv^ 

7201 GTCftGTCCAT TAACAA. 
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CYP2A13 cDNA 

S- ATOOCTOA TGCKXXrr^^ 

QCTOinraXXxAOOCACXJOCATrGO^^ 

GKnTCAOCATICACTn3GGG000DGGCX 

GCOGfTCAAGGAGGCTCroGro^^ 

QCAGGGCAOCTItXjACrCXXnOTCAAAG^ 

GCXnGGGCAAGOXXKBCAlOGAGGAACGCATO^ 

AGCXXXIACAGflCTaiAAltJICAl^^ 

CT/OGAGGACAAAGAGTnxntn^ 

CACHTCACGGGAAOCnUZADGGGGCAGC^ 

GAAACAOOGOeAGGAaiACAGGAAC^^ 

GQAGGACriCAia3a::AAGAAGgItX3^^ 

AATiraXAGGGGACITCATOGACnrxnTrci^^ 

GAAGAAOPGCA^IACAGAGTrcr^^ 

GAAOCTCnCITIXXXK3QCACraAGA00arC^^ 

GACAGAGIGATOXSCAAGAADOGGCAGODCA^ 
ATGOOGTAC ACAGA GGCACjrGATCCAC^ 
ClXXXX:ATGGCnTTiaQOCX:ACAGGaiCAACAAG^ 
iUiicuiU XTAAGGGCA CIGAAGIGITCOCnAI^^ 

GAOCPCAGCjI lU l lUia ^AACXXXXj^GGACnXAGimXAGCAj Li i UUIU GAT 

GAGAAGG GGCAGnTA AGAAGAGiaATG CI 1 A lUlljt XXZTriTCCAItXSGA 

AAGCGGlALlUirilUQAGAAGGOCntjGOCAGAATGGA^ 

'TCAOCAOCATCATGCAGAACrrroGCT^^ 

A3XXACXjlbUUJULCAAACAOCntXKXT^^ 

CATGAGCriULlULXJUUGCTGAGGnAnfYY^ ^ 

OGCyXCVVGGGAAA0CXXXXXXX3CAGQ^^ 

AAOAATGGGGGCAGflPGfiGGAAry^AAr KyX^AGAGgroGTTAGAC^ 

qAAOAAACA GAACKKyXTItZAGnTCL^XTrnATGAT^^ 

ATGAGAGGAAGGGAAACCrrArAfT TATGCTArAAAGAnTAnTAATAATA 

GCAGCyCTTATgaaGA 3' 
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3421 CCTCCACTTC AGOVTCTTCA. CCAGCCCCAC TTTATACCTG AGCACCTGAA. CAAAAGCCCC 
3.181 CAATCCAGRC CCASTAAGTA TCTGGACAGC TGTCTCCAAC CAAGTCCACT TGAATCOTCA 
3541 AATACCTABA CSUWTGCCAC TCACCTCATA CCAGCCCCAC CTGAASAGCT AAACACCTGG 
36D1 ACAUCTCTCT TCCAACTCAA CTTCACTTGA ATATCTGAAC ACCTAGATGT GTGCTCCAAT 
3661 CCAGCCTCAT TTSCATACCT GAAACCTOGA TATAICCCTC AGfTrCTTCTC ACOTJAA^^ 
3721 CTAGAOCSTG CCCCTGGCAC CTAATCCACG TOAAAACTTA GATATAAQTT TttMTCCAAC 
3781 CCCACTCAAA TaCCTAAACA CCTGSACACA TGCCTTTAAC TCCGTTCCTT CCTTGCSATC 
3841 AAACAAAIXCC CCATTCCCAT CAGCICCTCC CCCGTGACAC CTOTCCTTCC CTTCCCATCC 
3901 TCTCTCTOCA ACCCCAGCTC TATGAGATGT TCTCTTC G GT GATCAAACAC CTGC CABGAC 
3961 CACAGCAACA CGCCTTOAAG GACCTGCAAG GGCTCGAGOA CTTCATCGCC AAGAACCBBG 
4021 ACCACAAnCTA GCOCACOCFG GAITCCCAATO CCCCACCGCA CTTCATCSAC TCCTTTCTCA 
4081 TCOXMGCA GGAGGTACAT CCCAGCAGCC AGTQCAGGCA OGTCCAAACC CASGGASASG 
4141 GAAASCCAieGA TCOGAGTGOC GfTSGQCAGAC GACACAGGCC CATTCAAATT AGCCCiraTC 
4201 ATAA-CAAICC rPACAATTCO CCAGOCOCQB TdCCTCATGA CCietAATCC OUaSACTTTG 
4261 GGAGCCCGAG GCAGGTGGAT CACCT6AGGT CAGGAGTTCC ^^^JJ^ S^SSSS 
4321 GTGAAACCCC GTCTOraCIA AAAATACAAA AATOAGCTAG CTATQOTGGC ATCOGCCTCT 
4381 AJWTCCCAGCT ACTCaCSAGG CTSACACAGA AGAATTTGTT TQAMCCGGG AGGCAGAMT 
4441 TGCAeiOASC CGGGA.TCATC CCACTGCACT CCGGCCTCAO ^«f«ej°CA ^g^JgA 
4501 AAAAAAAAAA AAAAAAAAAA AAAAAATTCC GGAAAACCCC AATmCAT^ 
4561 TCCCATCTAC XtSAGCCCTCA CCCACAAGGA CGGGTTATQG ACGTCGA TCA OMCTreWlJ^ 
4621 AACraCTCAA GAACXACCGG GTQCCACGAA CTG6GTTAAG miTTr AJGA TAfifTCCGCCA 
4681 IGGAACACTP TOAACAIMTC TTGAGGGAGG TTCACTCAT6 GCCCCAGTTO TACAAAaCAC 
4T41 OAAACTGAOQ CCCAGASACX TTAAGTGTCT TAACTGAGCT- CACAACAGX6 AOGM^CCA 
480X TGGTCCCCCT ACCTCAAACC C lWi V l S-IC TGAGCCTATA CCTCQTGCTT TTAGCOira 
4861 TCCTCTCWkA CCQTTCATGT CCIGOTTAQC AGACACACCT CTCa-G GACAG CTCACCIGGC 
4921 TTTACATTOC AOOGTCCCCG CCXACCTCTO OAltSPCAGCC TCCCATGTCG aA A»>>uay xA 
4981 GGAACCCAAA GCTCACSOaAS AAAGGATCAA GGGAGOSAOT CCTCCAOGT AAGTtTCAAG 
S041 ATTTTTACGG AAGAAKTAGG ATCCTGTTCC TTAAAATTCT tfltJc riGTAaf CTCACAAAAA 
5101 S^^^ ^ ^So ^ ? CIGACTCTTC ATCTTCCCAT CTCTGIACTA Cl - riViUi I C ePCTCCCCTC 
5161 ATCCTTCICT TTCCAAATAt TCCTATCATT AAAAAA6TAA CAOACTGGGA AACATQGCAA 
5221 AACCCCGTCT GIACMUUVAA ATGGCTACGC ATGGICGTGC ^-^^C^^ 
5281 TAAGGAISCSrr GAGGIGGCAG GATATCTtGA GCCCAGGGTG GSCACTCCTT -^^^^ 
5341 GATATCACAO CCCTOCCCTC CACCCTGGGT GACAGAATAA GACCGTGTCT CCCAAAAAAA 
5401 AAAACAAXTA ATrTTTEAAC AGTTAACAAG TSAGCCTGCA TACTCATOTO CATCTGCAGT 
5461 TCCACCTACT CTG6AGGCTQ AGACCGGAGG ATTCCTTGAA CCCAGGACTT G^^OTGC 
5521 CTOTGCAACT TAGCAAGACC AACTCTOCAT AAAAAAAAAA AAAACCAftCT QACAGC T AA a 
5581 TTGACAATTA AAEGATAGAT GATCAGTGAG OTAAAGAAGG TQAGAAGSAA GAfiCATTTT^ 
5641 GGCAAAGCCA GCAGCCAGGG CAACGGCTOG AACCTGQAGC GAQTTTGBCA A^^^^^^ 
S7fll CCCTCTTTCC ACCTTTGGTC TGGACCAAAG AGJUMTAGCT CCAAAGOAAA AGCCCTAGAA 
5761 GGGCCCCAAG AGCATCGAGA GXGAGCTTOG XCTAAACCGC OCTCtC^ SSJSS 
5821 AGAACCCCAA CACASAGTTC TACTTGAAGA ACCTQGXGAT GACCACCCTO AACCTCTTCT 
5881 TTCCGCOCAC TGAQACCCTG AGCACCACCC TGCOCTACGG TTTCCTQCTG CTCATGAAGC 
5941 ACCCAG&GCT GGADOeTAAG ACTQQAAACG GAGGAAACTO AAfiGCCCCCA OACCCTOUIA 
6001 ACTCCCCTGA OCCTOGIGO^ OXGiaiCCCAC CTATCCOUai TC^^ 
6061 CCTTGCrorC CAGAGACAGG ACAATATTCA GCTGATAGGC ATC AGCICAC TCTCATTATC 
6121 TATTAAAA1A ■rTGAAAATOT CTCCACTCAT TCtSTCAOrCA Cl-C CmrCCC AAfCC»CTG 
6181 AGICTCCGCT GCCTGCTCCT CTGGATCATC CCCTAAOPTC CTCCCITCIC t^CTCTG 
S241 ATTCTGACAC AACCTGCTTT AACAGGGATC CTCCTCCAAA CAATGCGAAT GGGTCATGTC 
6301 TTGTTOrTCT TTATGAATOO GCTTACCCCT CGTGTCAGAG GTCGAACCTA J^^^^ 
nil SiSSSg CXAGGGOGGG CGATACATGC CCTGCTCTAA «=»cccctaga ^sQ^ 
6421 ATATTCCCCT CCTCCGCCAO CCAAGCTCCA TGAGCAGATT GACAGAOTGA TCCGCAAGAA 
filfll CCGGCACCCC AAGTTTGAGG ACCGGGCCAA GATCCCCTAC ACACAGBCAG TGATCCACGA 
SS SSSS TCCTCCCCAT GC^^ 

6601 CAAOTTTCGG CATTTCTTCC TCCCTAABGT QCIGTCTCCC CTCOWWACC ACCACTCAQA 
61561 CTACGGGBAC WCCAGCCTC ■i ri ' Cmm i'C CCCAGAATCC TGCCCCCATT AGTGTTCTAG 
Bill SSer^ JScCOTCAA TCAGTCAAAA AAOACTTCCC CAACCACCAC ATCTGTTCCA 
Vrll SScScT TAGACASTCC TCAGTCCTGC ATCTCGCCAC ACTCTTTOTG TCAGSAGAAT 
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3421 CCTCCACTTC AGCATCTTCA CCAGCCCCAC TTTATACCTG XGCACCTGAJL CAAXAOCCCC 
3481 CAATCCAGAC CXIAGTAAGTA TCTGC3ACAGC TGTCTCCAAC CAAGTCCACT TGAATGCCTA 
3541 AATACCTAGA CACGTGCCAC TCACCTCATA CCAGCCCCAC CTCAAGAGCT AAACACCTGG 
3601 ACACCTCTCT TCCAACTCAA CTTCACTTGA ATATCTGAXC ACCTAGATGT GTGCTCCAAT 
3661 CCAGCCTCAT TTGCATACCT GAAACCTGGA TATATOCCTC AGTTCTTCTC ACCiaAATTA 
3721 CTAGACCGTG CCCCTGGCAC CTAATCCACG TCAAAACTTA GAT ATAAGIT XMMTCCAAC 
3781 CCCACTGAAA TACCTAAACA CCTOGACAGA TGCCTTTAAC TCCGTTCCTP CCTTGCTATG 
3841 AAACAAATCC CCATTCCCAT CAGCTCCT G C CCCGTOACAC CTC5TCCTTCC CTTCCCATCC 
3901 TCTCTCTGCA ACCCCAGCTC TATGAGATGT TCTCTTCGGT GATGAAACAC CTGCCAGGAC 
3961 CACAGCAACA GCCCTTTAAG GACCTCCAAG GGCTGGAGQA CTTCATCGCC AAGAACGTCG 
4021 AGCACAACCA GCGCACGCTG GATCCCAATT CCCCACGGGA CTTCATCCyiC TCCTTTCTCA 
4081 TCCGCATGCA GGAGGTACAT CCCAGCAGCC AGTGCAGGCA GOTGCAAACC CAGGGAGAGC 
4141 GAAATCA06A TGGGAGTGGC CTGGGCAGAC GACACAGQCC C ATTCA AATT AGCCCTCGTC 
4201 ATAATAATCC TTACAATTGG CCAGGCGCGQ TGCCTCATGA CCTCTAATCC CAGCACTOTG 
4261 GGA GG C CG AG GCACGTCGAT CACCTGAGGT CAGGAGTTCC AGACCAGCCT 
4321 GTGAAACCCC GTCTCiaCTA AAAATOkCAAA AATGAGCTAG CTATGGTGGC ATGCGCCTCT 
4381 AATCCCAGCT ACTCAGGAGG CTGAGACAGA AGAATTTGTT TQAATCCGGG AGGCA GAGC T 
4441 TGCAGrPGAGC CGQGATCATG CCACTGCACT CCGGCCTGAG TGACAGACCA AGACCC TGTA 
4S01 AAAAAAAAAA AAAAAAAAAA AAAAAATTCC GGAAAACCCC A ATTA CATCA CCCACTGCTG 
4561 TCCCATCTAC TGAGCCCTCA CCCACAAGGA CGGGTTATGG AGGTCGATTA GATICGAAAG 
4621 AACTTCTCAA GAACTACCGG GTGCCACCAA CTGGGTXAAG TGTTTT ATGA TAGTCCGCCA 
4681 TGOAACACTT TTAACACTTC TTGAGGGACC TTCACTCATG GCCCCAGTTG TACAAATGAG 
4741 GAAACTGAGG COCAGAGAGT TTAA G TGTCT TAACTGAGGT- CACAACAGTG AGGAAGACCA 
4801 TQGTCOOCCT AOCTCAAACC CrGG T C TCTC TCAGCCTATA GCrGGTGCTT TTAGCCACCA 
4061 T G CTCTCTAA CCCSTTCATGT CCTOGTTAGC AGACACACCT CTGTGGACAG CTGAC CTGGC 
4921 TTTACATTGC ACGCTCCCCG CCTACCTCTG GATCTCAGCC TCCCATGTGG GA AGGCTT TA 
4981 GGAAGCCAAA GCTCAGGGAG AAASGATCAA GGGAGGGATT CCTCCACAGT AAGTTTCAAG 
S041 ATTTTTAOGG AAQAAATAGG ATBCTGXTCC TTAAAArTCT GTGCrTGTAT CTIxaGAft^ 

5101 cTcrryri TT ctgactcttc atcttgccat ctctgtacta crrrcrcTTC gtctcccctc 
5161 atccttctct ttccaaatat tcctatcatt aaaaaagtaa cagactggga aacatggcaa 
5221 aaccccgtct gtacaaaaaa atggctaccc atggtgctgc at6ccpgcgg tcccagctac 
5281 taaggaggtt gaggtgqgag ga1atcttga gcccagggtg ggc agacgtt tcaatgaccc 

5341 GATATCACAG CCCTGCCCTC CA CC CTGGGT GACAGAATAA GACCGTGTCT CCCAAAAAAA 
5401 AAAAGAATTA ATTTTTTAAC AGXTAACAAG TGAGCCTGCA TAGTCATGTG CATGTGCAGT 
5461 TCCAGCTACT CTGSACCCTG AGACCGCAGG ATTCCTTGRA CCCAGGACTT GGAGTCCAGC 
5521 CTGTGCAACT TAGCAAGACC AAGTCTGCAT AAAAAAAAAA AAAACCAACT GACAG CTAAG 
5581 TTGACAATTA AAGBATAGAT CATCAGT6AC GTAAAGAAGG TG AGAAG GAA GAGCATTTTC 
5641 GGCAAAGCCA GCAQCCAGGG CAACGGCTQG AACCTGGAGC GAGTTTGGCA AATCTAGGGT 
5701 CCCTCTTTCC ACCTTTGGTC TG6ACCAAAQ AGAGCZPAGCT CCAAAGGAAA AGCCCTAGAA 
5761 GGGC C C C AAG AGCATGGACA GTQAGCTT GG TCTAAACCGC CLTlVrCCCTG CA GCAGQAGA 
5B21 AGAACCCCAA CACAGAGTTC lACTTOAAGA ACCTGGTGAT GACCACCCTG AACCTCTTCT 
5881 TTGC GCG CAC TGAGACCGTG AGCACCACOC TQCGCTACGG TTTCCTGCTG CTCATGAAGC 
5941 ACCCAGAGGT GGAGGGTAAG ACTGGAAACG GAOGAAAGTG AAGGGCCCCA GACCCTCAAA 
6001 ACTCCCCTGA C C CTGGTG CA G TOT A CCCAC CTATCCCAGA TCCCAGGACC CTGAGACGTG 
6061 C CT TGCTGrr C CAGAfiACAOG ACAATATTCA GCTGATAGGC ATCA GCTGAG TCTCATTAGC 
6121 TASTAAAATA TTCAAAATGT CTGCACTGAT TGGTCAGTCA CTCCTGTCCC AAGCCCACTG 
6181 AGTOTCCGCT GCCTGCTCCT CTGGATCATC CCCTAAGTTC CTCCCTTGTC CTACCCTCTG 
5241 ATTCTGACAC AACCTGGrTT AACAGGGATC CTGCTCCAAA CAATGCGAAT GGGTG ATGTC 
6301 TTGTTCTTCT TTATGAAIGG GCTTACCCTT CGTGTCAGAO GTGGAAGCTA XCTCAACCGC 
6361 CG TGTTCTAG CTAGGGOGGG CGATACATGC CCTGCTCTAA GACCCCTAGA GAGGGTAAAG 
6421 ATATTCCCCT CCTCCGCCAG CCAAGGTCCA TGAGGACATT GACAGAGTGA TCGGCAAGAA 
6481 CCGGCAGCCC AAGTTTGAGG ACCGGGCCAA GATGCCCTAC ACACAGGCAG TGATCCACGA 
6S41 GATCCAAAGA TTTGGAGACA TCCTCCCCAT GGGTITOCCC CACAGGCTCA ACAACGACAC 
6601 CAAQPTTCGG CATTTCTTCC TCCCTAAGGT GCTGTCTCCC CTCCACCACC ACC ACTCAG A 
6661 CTACGGGCSAC TTCCAGCCTC T C axrTGTCTC CCCAGAATCC TGCCCCCATT AGTCTTCTAG 
6721 ACTCTCTCCC ACTCCCTCAA TCAGTCAAAA AAGACTTCCC CAACCACCAC ATCTGTTCCA 
6781 CCrtTCCACT TAGACAGTCC TGAGTCCTGC ATCTCGCCAC ACTCTTTCTG TCAGGAGAAT 
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6841 ACACCCaTG TTCCCAATCT TCCTGTCTTA AGWlACAGWl GCCCCCTTO CMTaCG^T 
6503 TCn«GCTTJ«5 GGMZACAI^T CTCAGGTCCC TCAAACACCC ^^f^^ JJ^^S^- 
6961 CCATCTCTCC CAAACITCCT BTCTCJU3AGA CATGWACTr CXATCCCCCA **GCTCCT^ 
7021 CTCM»GGTC CCCA*«CCT CCXTOTCGTS CCACTCCCCG ^CT^ JSS^^ 
7081 CCCCTCQAGC CCCTGTGTW: •rrPCACCAAT CCCCCCAACC TGGCTCA.TAA CACRCACCTT 

cSSS SSgggcac T^AA^rcrnc cctatgct^b gctccgagct ^^^^ 

7201 MqSStCT CCJACCCCCA (WACTGCKW CCCCAGCACT TCCTGBATGA CAAfiGGGCRG 
73S1 TTTAAOAAOA OPGATGCTTT TOTCCCCTTP TCCATCGGTA AGAGACACTG TOTOCTGCCA 
•7321 GGCCaCOeCT CaCACCAGCA GGGGCCTCTC TCACCCACCT CCCCTCTCTG CGOTGTAGCC 
?3?1 SSStSct CCAGCTOSCa iUSTTC^ 

7441 TOCTCCCTTA ACTGCCAABC ACCCAATACC TQCGCCCAGG TAAAAGGGAA GGAAACATCT 
7S01 TTOCCOLTAC ATTTATTTOT CTAGGGTCAC ACAQCAOMT CTTCACCTCC CTCXAAAGGA 
7561 S^SotA CACCACAGCA GTCATaTTTG CAAI5TGTATC TGGGGIMTAG GOOCATCTAA 
^IS SSSx gSScSg GCAI6GATCA CCCCATCTAT GATGGMCCA TCACATTATG 
7681 CCTEOTTCGA AACCCATAia ACTOIATAAC ACACACISAAA CCCTAATCTA AACra^PG^ 
7741 TTTGOTTACT AATAATATAT CAATATWSGT TCACCATTCT TATATCTCTT A3AGAACGAA 
7801 ACTOAABCTC AGGQAGOATC GQAfiTCTCCT CTGAAABTCT CTCAGGCCAn" AATATTCCCA 
7B61 CCCCTCCTCC CTAOAGAGTO CAGCCGGGSO TCACISAGGGG TTGAGSCteC ACTGAGAGTG 
7921 CCCTTCACCT TCACCCCTCC TOCCTCTCCT CCTCA«5SAAA OCOGTACTGT ™CACAAG 
79B1 GCCT^S« AATGGAGCTC r i-i-L-l-CTTCT TCACCACCAT CATCCAGAAC TOTCGCTTCA 
BD41 AGTCCCCTCA GTCGCCTAAG GATATCC3ACO TOWCCCCAA ACACGTGGGC TTTGCCACGA 
iSSSS ?^CCATC AGCITCei«C CCCGCTGACC ^^'SS?^ 
SSl GCICeiOOGC GC^GCCAGGG AAACGCCC6G SJCAGGGGCO GWCTTCTGG 
B221 GCTAAQAATG GGGGCAGIGG GGGAAOaUUS GOCaAGAGQTG GCTAGASQSA ACAGAAGAAA 
nil SIaSSc TCACTTCACC TTGATGATCT CCTTCABAGC TOraATOAGA QQ*AG^ 
B341 CCTTACACSIiA -tCCTACAAAO AOTAGTAATA ATAfiCAOCTC TTATCTCCTG AACAACSTCre 
B401 TCCCTOTCAB CTTronCAA AAAOCOMQC ACCCTCACCT CACTTATTTC CCACACACCT 
B461 cSSSto CGGAAAAGTC TTCATTCCCC TTTTTACACG TGAGAAAG«3r GCCCCTCAGA 
BS21 AAQTTCTCTC TATCTCAAAA OICACAAAAC 6CAASTGTCC ACAGGMCTT QGAACACASA 
8581 TCTCGGCCCA TAGCCCTCTA ttATCGATCCT CACCATABCA CCCOTCTO ACOXWUJITA 
B641 GCTTAGTATA GCA1CACATO GCCTGAACAC CCCTGGGCOO GGGaCTTCCC ^^^^^ 
8701 GCCSeGCGGCT GCCCTGCOTA CTCTGTACRC TCGCCTACTC GGGRCGRTCC GGGCACCAGG 
8761 CTGTCACCTG AGCTCGCTA 



FIG. 14 (Sheet 3) 



wo 95/34679 



2 9/29 



PCT/US95/07605 



^ US 




0.01 






J» 


oo 


o 


c\irw 


tn 


cocsi 




1 ■ 


1 




